Overview

Dataset statistics

Number of variables58
Number of observations12540
Missing cells143561
Missing cells (%)19.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.3 MiB
Average record size in memory443.0 B

Variable types

Categorical27
Numeric24
Boolean7

Alerts

msg_patrocinado has constant value "PATROCINADO" Constant
enable_popup has constant value "True" Constant
escalonado has constant value "False" Constant
_id has a high cardinality: 12540 distinct values High cardinality
vencimento has a high cardinality: 1797 distinct values High cardinality
taxa has a high cardinality: 1311 distinct values High cardinality
emissor has a high cardinality: 406 distinct values High cardinality
amortizacao has a high cardinality: 54 distinct values High cardinality
carencia has a high cardinality: 392 distinct values High cardinality
teq has a high cardinality: 1703 distinct values High cardinality
cod_cetip has a high cardinality: 291 distinct values High cardinality
qtdMinima is highly correlated with tipo and 12 other fieldsHigh correlation
incentivada is highly correlated with tipo and 6 other fieldsHigh correlation
juros is highly correlated with tipo and 19 other fieldsHigh correlation
preco is highly correlated with tipo and 11 other fieldsHigh correlation
tir is highly correlated with tipo and 7 other fieldsHigh correlation
vir is highly correlated with tipo and 8 other fieldsHigh correlation
dc is highly correlated with amortizacao and 12 other fieldsHigh correlation
du is highly correlated with amortizacao and 12 other fieldsHigh correlation
rbd is highly correlated with tipo and 19 other fieldsHigh correlation
rbm is highly correlated with tipo and 19 other fieldsHigh correlation
rba is highly correlated with tipo and 20 other fieldsHigh correlation
rbp is highly correlated with amortizacao and 7 other fieldsHigh correlation
rlp is highly correlated with amortizacao and 7 other fieldsHigh correlation
rld is highly correlated with tipo and 22 other fieldsHigh correlation
rlm is highly correlated with tipo and 22 other fieldsHigh correlation
rla is highly correlated with tipo and 21 other fieldsHigh correlation
vb is highly correlated with qtdMinima and 10 other fieldsHigh correlation
vl is highly correlated with qtdMinima and 10 other fieldsHigh correlation
vrl is highly correlated with qtdMinima and 10 other fieldsHigh correlation
prlt is highly correlated with amortizacao and 7 other fieldsHigh correlation
rpp is highly correlated with tipo and 9 other fieldsHigh correlation
vpp is highly correlated with tipo and 11 other fieldsHigh correlation
tt is highly correlated with tipo and 20 other fieldsHigh correlation
am is highly correlated with tipo and 14 other fieldsHigh correlation
total is highly correlated with tipo and 19 other fieldsHigh correlation
avista_id is highly correlated with tipo and 8 other fieldsHigh correlation
tipo is highly correlated with subtipo and 34 other fieldsHigh correlation
subtipo is highly correlated with tipo and 12 other fieldsHigh correlation
patrocinado is highly correlated with corretora and 4 other fieldsHigh correlation
liquidez is highly correlated with tipo and 26 other fieldsHigh correlation
qualificado is highly correlated with tipo and 2 other fieldsHigh correlation
amortizacao is highly correlated with tipo and 17 other fieldsHigh correlation
rating is highly correlated with tipo and 23 other fieldsHigh correlation
agencia is highly correlated with tipo and 9 other fieldsHigh correlation
corretora is highly correlated with tipo and 30 other fieldsHigh correlation
nr is highly correlated with tipo and 5 other fieldsHigh correlation
a is highly correlated with tipo and 30 other fieldsHigh correlation
investir is highly correlated with corretora and 5 other fieldsHigh correlation
tp_d is highly correlated with tipo and 20 other fieldsHigh correlation
chat is highly correlated with liquidez and 8 other fieldsHigh correlation
url is highly correlated with tipo and 23 other fieldsHigh correlation
tp_mercado is highly correlated with tipo and 14 other fieldsHigh correlation
idx is highly correlated with tipo and 14 other fieldsHigh correlation
rpd is highly correlated with corretora and 4 other fieldsHigh correlation
cores is highly correlated with tipo and 30 other fieldsHigh correlation
logo is highly correlated with tipo and 30 other fieldsHigh correlation
rico_tipo is highly correlated with tipo and 9 other fieldsHigh correlation
xp_tipo is highly correlated with tipo and 9 other fieldsHigh correlation
subtipo has 12534 (> 99.9%) missing values Missing
msg_patrocinado has 12534 (> 99.9%) missing values Missing
qualificado has 6967 (55.6%) missing values Missing
amortizacao has 9505 (75.8%) missing values Missing
carencia has 1006 (8.0%) missing values Missing
rating has 7629 (60.8%) missing values Missing
agencia has 7629 (60.8%) missing values Missing
tp_d has 514 (4.1%) missing values Missing
chat has 304 (2.4%) missing values Missing
enable_popup has 12528 (99.9%) missing values Missing
url has 12265 (97.8%) missing values Missing
rico_tipo has 11703 (93.3%) missing values Missing
xp_tipo has 11173 (89.1%) missing values Missing
cod_cetip has 12231 (97.5%) missing values Missing
escalonado has 12523 (99.9%) missing values Missing
avista_id has 12516 (99.8%) missing values Missing
vir is highly skewed (γ1 = 22.0592276) Skewed
_id is uniformly distributed Uniform
subtipo is uniformly distributed Uniform
cod_cetip is uniformly distributed Uniform
_id has unique values Unique
nr has 2426 (19.3%) zeros Zeros
vir has 3446 (27.5%) zeros Zeros

Reproduction

Analysis started2022-09-18 18:03:59.627456
Analysis finished2022-09-18 18:06:13.285834
Duration2 minutes and 13.66 seconds
Software versionpandas-profiling v3.3.0
Download configurationconfig.json

Variables

_id
Categorical

HIGH CARDINALITY
UNIFORM
UNIQUE

Distinct12540
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
{'$oid': '6325e0f59491f55d25c880cb'}
 
1
{'$oid': '632480fb02b43e2511270e4d'}
 
1
{'$oid': '6320a1053f9ffef9754d8100'}
 
1
{'$oid': '6320a1053f9ffef9754d814d'}
 
1
{'$oid': '6320a1053f9ffef9754d8161'}
 
1
Other values (12535)
12535 

Length

Max length36
Median length36
Mean length36
Min length36

Characters and Unicode

Total characters451440
Distinct characters24
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12540 ?
Unique (%)100.0%

Sample

1st row{'$oid': '6325e0f59491f55d25c880cb'}
2nd row{'$oid': '6325e0f59491f55d25c880cd'}
3rd row{'$oid': '6325e0f59491f55d25c880c9'}
4th row{'$oid': '6325e0f59491f55d25c880ca'}
5th row{'$oid': '6325e0f59491f55d25c880c8'}

Common Values

ValueCountFrequency (%)
{'$oid': '6325e0f59491f55d25c880cb'}1
 
< 0.1%
{'$oid': '632480fb02b43e2511270e4d'}1
 
< 0.1%
{'$oid': '6320a1053f9ffef9754d8100'}1
 
< 0.1%
{'$oid': '6320a1053f9ffef9754d814d'}1
 
< 0.1%
{'$oid': '6320a1053f9ffef9754d8161'}1
 
< 0.1%
{'$oid': '6321f5dbf13b97c71f160671'}1
 
< 0.1%
{'$oid': '6321f5dbf13b97c71f1606ac'}1
 
< 0.1%
{'$oid': '6321f5dbf13b97c71f1606be'}1
 
< 0.1%
{'$oid': '6321f5dbf13b97c71f1606cd'}1
 
< 0.1%
{'$oid': '6321f5dbf13b97c71f1606d2'}1
 
< 0.1%
Other values (12530)12530
99.9%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
oid12540
50.0%
632480fb02b43e2511270f8f1
 
< 0.1%
62f1262520821896c213ad4a1
 
< 0.1%
632480fb02b43e2511270f841
 
< 0.1%
6325e0f59491f55d25c880c91
 
< 0.1%
6325e0f59491f55d25c880ca1
 
< 0.1%
6325e0f59491f55d25c880c81
 
< 0.1%
6325e0f59491f55d25c880cc1
 
< 0.1%
62f1262520821896c213ad331
 
< 0.1%
62f1262520821896c213ad101
 
< 0.1%
Other values (12531)12531
50.0%

Most occurring characters

ValueCountFrequency (%)
'50160
 
11.1%
327662
 
6.1%
626468
 
5.9%
d25851
 
5.7%
224491
 
5.4%
422354
 
5.0%
f22207
 
4.9%
021754
 
4.8%
520054
 
4.4%
119739
 
4.4%
Other values (14)190700
42.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number207943
46.1%
Lowercase Letter130637
28.9%
Other Punctuation62700
 
13.9%
Space Separator12540
 
2.8%
Close Punctuation12540
 
2.8%
Currency Symbol12540
 
2.8%
Open Punctuation12540
 
2.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
327662
13.3%
626468
12.7%
224491
11.8%
422354
10.8%
021754
10.5%
520054
9.6%
119739
9.5%
817949
8.6%
716031
7.7%
911441
5.5%
Lowercase Letter
ValueCountFrequency (%)
d25851
19.8%
f22207
17.0%
a17837
13.7%
b14272
10.9%
e13061
10.0%
i12540
9.6%
o12540
9.6%
c12329
9.4%
Other Punctuation
ValueCountFrequency (%)
'50160
80.0%
:12540
 
20.0%
Space Separator
ValueCountFrequency (%)
12540
100.0%
Close Punctuation
ValueCountFrequency (%)
}12540
100.0%
Currency Symbol
ValueCountFrequency (%)
$12540
100.0%
Open Punctuation
ValueCountFrequency (%)
{12540
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common320803
71.1%
Latin130637
28.9%

Most frequent character per script

Common
ValueCountFrequency (%)
'50160
15.6%
327662
 
8.6%
626468
 
8.3%
224491
 
7.6%
422354
 
7.0%
021754
 
6.8%
520054
 
6.3%
119739
 
6.2%
817949
 
5.6%
716031
 
5.0%
Other values (6)74141
23.1%
Latin
ValueCountFrequency (%)
d25851
19.8%
f22207
17.0%
a17837
13.7%
b14272
10.9%
e13061
10.0%
i12540
9.6%
o12540
9.6%
c12329
9.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII451440
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
'50160
 
11.1%
327662
 
6.1%
626468
 
5.9%
d25851
 
5.7%
224491
 
5.4%
422354
 
5.0%
f22207
 
4.9%
021754
 
4.8%
520054
 
4.4%
119739
 
4.4%
Other values (14)190700
42.2%

tipo
Categorical

HIGH CORRELATION

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
CDB
8838 
LCA
1537 
DEB
1052 
LCI
 
550
CRA
 
310
Other values (9)
 
253

Length

Max length10
Median length3
Mean length2.993062201
Min length2

Characters and Unicode

Total characters37533
Distinct characters18
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st rowATIVO REAL
2nd rowATIVO REAL
3rd rowATIVO REAL
4th rowATIVO REAL
5th rowCCB

Common Values

ValueCountFrequency (%)
CDB8838
70.5%
LCA1537
 
12.3%
DEB1052
 
8.4%
LCI550
 
4.4%
CRA310
 
2.5%
CRI101
 
0.8%
LF82
 
0.7%
LC43
 
0.3%
LIG11
 
0.1%
RDB8
 
0.1%
Other values (4)8
 
0.1%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
cdb8838
70.5%
lca1537
 
12.3%
deb1052
 
8.4%
lci550
 
4.4%
cra310
 
2.5%
cri101
 
0.8%
lf82
 
0.7%
lc43
 
0.3%
lig11
 
0.1%
rdb8
 
0.1%
Other values (5)13
 
0.1%

Most occurring characters

ValueCountFrequency (%)
C11381
30.3%
B9899
26.4%
D9898
26.4%
L2229
 
5.9%
A1857
 
4.9%
E1058
 
2.8%
I667
 
1.8%
R425
 
1.1%
F83
 
0.2%
G11
 
< 0.1%
Other values (8)25
 
0.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter37528
> 99.9%
Space Separator5
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C11381
30.3%
B9899
26.4%
D9898
26.4%
L2229
 
5.9%
A1857
 
4.9%
E1058
 
2.8%
I667
 
1.8%
R425
 
1.1%
F83
 
0.2%
G11
 
< 0.1%
Other values (7)20
 
0.1%
Space Separator
ValueCountFrequency (%)
5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin37528
> 99.9%
Common5
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
C11381
30.3%
B9899
26.4%
D9898
26.4%
L2229
 
5.9%
A1857
 
4.9%
E1058
 
2.8%
I667
 
1.8%
R425
 
1.1%
F83
 
0.2%
G11
 
< 0.1%
Other values (7)20
 
0.1%
Common
ValueCountFrequency (%)
5
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII37533
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C11381
30.3%
B9899
26.4%
D9898
26.4%
L2229
 
5.9%
A1857
 
4.9%
E1058
 
2.8%
I667
 
1.8%
R425
 
1.1%
F83
 
0.2%
G11
 
< 0.1%
Other values (8)25
 
0.1%

subtipo
Categorical

HIGH CORRELATION
MISSING
UNIFORM

Distinct6
Distinct (%)100.0%
Missing12534
Missing (%)> 99.9%
Memory size98.1 KiB
CARTEIRA DE PRECATÓRIOS MUNICIPAIS #38/2022
CARTEIRA DE PRECATÓRIOS FEDERAIS ALIMENTARES #39/2022
CARTEIRA FEDERAL #36/2022
OPERAÇÃO PRECATÓRIO ESTADUAL - PE #37/2022
CCB - INCORPORAÇÃO IMOBILIÁRIA RESIDENCIAL - PROJETO CASA IDEAL #04/2022

Length

Max length72
Median length42.5
Mean length45.66666667
Min length25

Characters and Unicode

Total characters274
Distinct characters35
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)100.0%

Sample

1st rowCARTEIRA DE PRECATÓRIOS MUNICIPAIS #38/2022
2nd rowCARTEIRA DE PRECATÓRIOS FEDERAIS ALIMENTARES #39/2022
3rd rowCARTEIRA FEDERAL #36/2022
4th rowOPERAÇÃO PRECATÓRIO ESTADUAL - PE #37/2022
5th rowCCB - INCORPORAÇÃO IMOBILIÁRIA RESIDENCIAL - PROJETO CASA IDEAL #04/2022

Common Values

ValueCountFrequency (%)
CARTEIRA DE PRECATÓRIOS MUNICIPAIS #38/20221
 
< 0.1%
CARTEIRA DE PRECATÓRIOS FEDERAIS ALIMENTARES #39/20221
 
< 0.1%
CARTEIRA FEDERAL #36/20221
 
< 0.1%
OPERAÇÃO PRECATÓRIO ESTADUAL - PE #37/20221
 
< 0.1%
CCB - INCORPORAÇÃO IMOBILIÁRIA RESIDENCIAL - PROJETO CASA IDEAL #04/20221
 
< 0.1%
ACERVO JJ - MULHERES CONCRETAS #03/20221
 
< 0.1%
(Missing)12534
> 99.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
4
 
11.1%
carteira3
 
8.3%
precatórios2
 
5.6%
de2
 
5.6%
ccb1
 
2.8%
concretas1
 
2.8%
mulheres1
 
2.8%
jj1
 
2.8%
acervo1
 
2.8%
04/20221
 
2.8%
Other values (19)19
52.8%

Most occurring characters

ValueCountFrequency (%)
30
 
10.9%
A25
 
9.1%
E25
 
9.1%
R24
 
8.8%
I19
 
6.9%
218
 
6.6%
C15
 
5.5%
O13
 
4.7%
T10
 
3.6%
S10
 
3.6%
Other values (25)85
31.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter192
70.1%
Decimal Number36
 
13.1%
Space Separator30
 
10.9%
Other Punctuation12
 
4.4%
Dash Punctuation4
 
1.5%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A25
13.0%
E25
13.0%
R24
12.5%
I19
9.9%
C15
7.8%
O13
 
6.8%
T10
 
5.2%
S10
 
5.2%
P8
 
4.2%
D7
 
3.6%
Other values (13)36
18.8%
Decimal Number
ValueCountFrequency (%)
218
50.0%
08
22.2%
35
 
13.9%
41
 
2.8%
81
 
2.8%
71
 
2.8%
61
 
2.8%
91
 
2.8%
Other Punctuation
ValueCountFrequency (%)
/6
50.0%
#6
50.0%
Space Separator
ValueCountFrequency (%)
30
100.0%
Dash Punctuation
ValueCountFrequency (%)
-4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin192
70.1%
Common82
29.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
A25
13.0%
E25
13.0%
R24
12.5%
I19
9.9%
C15
7.8%
O13
 
6.8%
T10
 
5.2%
S10
 
5.2%
P8
 
4.2%
D7
 
3.6%
Other values (13)36
18.8%
Common
ValueCountFrequency (%)
30
36.6%
218
22.0%
08
 
9.8%
/6
 
7.3%
#6
 
7.3%
35
 
6.1%
-4
 
4.9%
41
 
1.2%
81
 
1.2%
71
 
1.2%
Other values (2)2
 
2.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII266
97.1%
None8
 
2.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
30
 
11.3%
A25
 
9.4%
E25
 
9.4%
R24
 
9.0%
I19
 
7.1%
218
 
6.8%
C15
 
5.6%
O13
 
4.9%
T10
 
3.8%
S10
 
3.8%
Other values (21)77
28.9%
None
ValueCountFrequency (%)
Ó3
37.5%
Ç2
25.0%
Ã2
25.0%
Á1
 
12.5%

vencimento
Categorical

HIGH CARDINALITY

Distinct1797
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
365 dias
 
943
1096 dias
 
450
181 dias
 
448
730 dias
 
414
721 dias
 
330
Other values (1792)
9955 

Length

Max length9
Median length8
Mean length8.324880383
Min length7

Characters and Unicode

Total characters104394
Distinct characters15
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique843 ?
Unique (%)6.7%

Sample

1st row547 dias
2nd row577 dias
3rd row638 dias
4th row790 dias
5th row730 dias

Common Values

ValueCountFrequency (%)
365 dias943
 
7.5%
1096 dias450
 
3.6%
181 dias448
 
3.6%
730 dias414
 
3.3%
721 dias330
 
2.6%
731 dias322
 
2.6%
90 dias310
 
2.5%
1095 dias265
 
2.1%
1080 dias239
 
1.9%
1461 dias238
 
1.9%
Other values (1787)8581
68.4%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
dias12540
50.0%
365943
 
3.8%
1096450
 
1.8%
181448
 
1.8%
730414
 
1.7%
721330
 
1.3%
731322
 
1.3%
90310
 
1.2%
1095265
 
1.1%
1080239
 
1.0%
Other values (1788)8819
35.2%

Most occurring characters

ValueCountFrequency (%)
12540
12.0%
d12540
12.0%
i12540
12.0%
a12540
12.0%
s12540
12.0%
18212
7.9%
04973
 
4.8%
34766
 
4.6%
64439
 
4.3%
23974
 
3.8%
Other values (5)15330
14.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter50160
48.0%
Decimal Number41694
39.9%
Space Separator12540
 
12.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
18212
19.7%
04973
11.9%
34766
11.4%
64439
10.6%
23974
9.5%
73367
8.1%
53276
 
7.9%
92971
 
7.1%
82909
 
7.0%
42807
 
6.7%
Lowercase Letter
ValueCountFrequency (%)
d12540
25.0%
i12540
25.0%
a12540
25.0%
s12540
25.0%
Space Separator
ValueCountFrequency (%)
12540
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common54234
52.0%
Latin50160
48.0%

Most frequent character per script

Common
ValueCountFrequency (%)
12540
23.1%
18212
15.1%
04973
 
9.2%
34766
 
8.8%
64439
 
8.2%
23974
 
7.3%
73367
 
6.2%
53276
 
6.0%
92971
 
5.5%
82909
 
5.4%
Latin
ValueCountFrequency (%)
d12540
25.0%
i12540
25.0%
a12540
25.0%
s12540
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII104394
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12540
12.0%
d12540
12.0%
i12540
12.0%
a12540
12.0%
s12540
12.0%
18212
7.9%
04973
 
4.8%
34766
 
4.6%
64439
 
4.3%
23974
 
3.8%
Other values (5)15330
14.7%

qtdMinima
Real number (ℝ≥0)

HIGH CORRELATION

Distinct2768
Distinct (%)22.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11100.64517
Minimum0.94
Maximum1000000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0.94
5-th percentile204
Q11041.425168
median10000
Q310000
95-th percentile10000
Maximum1000000
Range999999.06
Interquartile range (IQR)8958.574832

Descriptive statistics

Standard deviation56328.58
Coefficient of variation (CV)5.074351908
Kurtosis216.2744177
Mean11100.64517
Median Absolute Deviation (MAD)5000
Skewness14.15453596
Sum139202090.4
Variance3172908924
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100006143
49.0%
10001687
 
13.5%
5000504
 
4.0%
20000231
 
1.8%
50000116
 
0.9%
500100
 
0.8%
3000071
 
0.6%
159
 
0.5%
5044
 
0.4%
10044
 
0.4%
Other values (2758)3541
28.2%
ValueCountFrequency (%)
0.942
< 0.1%
0.961
< 0.1%
0.960382671
< 0.1%
0.975395531
< 0.1%
0.976224411
< 0.1%
0.979631881
< 0.1%
0.982067631
< 0.1%
0.98368341
< 0.1%
0.986017031
< 0.1%
0.986529551
< 0.1%
ValueCountFrequency (%)
100000021
0.2%
75000021
0.2%
50000021
0.2%
392178.56321
 
< 0.1%
342895.70461
 
< 0.1%
250976.8971
 
< 0.1%
25000026
0.2%
209539.52831
 
< 0.1%
157803.79841
 
< 0.1%
138622.38581
 
< 0.1%

taxa
Categorical

HIGH CARDINALITY

Distinct1311
Distinct (%)10.5%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
IPCA +9.0%
 
586
15.0%
 
563
115.0% CDI
 
277
100.0% CDI
 
234
109.0% CDI
 
214
Other values (1306)
10666 

Length

Max length12
Median length11
Mean length8.554066986
Min length4

Characters and Unicode

Total characters107268
Distinct characters19
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique397 ?
Unique (%)3.2%

Sample

1st row236.34% CDI
2nd row235.52% CDI
3rd row221.5% CDI
4th row213.0% CDI
5th row189.0% CDI

Common Values

ValueCountFrequency (%)
IPCA +9.0%586
 
4.7%
15.0%563
 
4.5%
115.0% CDI277
 
2.2%
100.0% CDI234
 
1.9%
109.0% CDI214
 
1.7%
106.0% CDI205
 
1.6%
104.0% CDI191
 
1.5%
112.0% CDI184
 
1.5%
102.0% CDI160
 
1.3%
108.0% CDI153
 
1.2%
Other values (1301)9773
77.9%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
cdi4913
23.6%
ipca3301
 
15.8%
9.0586
 
2.8%
15.0563
 
2.7%
115.0277
 
1.3%
100.0234
 
1.1%
109.0214
 
1.0%
106.0205
 
1.0%
104.0191
 
0.9%
112.0184
 
0.9%
Other values (1270)10173
48.8%

Most occurring characters

ValueCountFrequency (%)
%12540
11.7%
.12540
11.7%
110901
10.2%
08978
 
8.4%
8301
 
7.7%
I8214
 
7.7%
C8214
 
7.7%
55186
 
4.8%
D4913
 
4.6%
+3388
 
3.2%
Other values (9)24093
22.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number42556
39.7%
Uppercase Letter27943
26.0%
Other Punctuation25080
23.4%
Space Separator8301
 
7.7%
Math Symbol3388
 
3.2%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
110901
25.6%
08978
21.1%
55186
12.2%
93361
 
7.9%
22848
 
6.7%
32764
 
6.5%
62467
 
5.8%
42165
 
5.1%
71974
 
4.6%
81912
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
I8214
29.4%
C8214
29.4%
D4913
17.6%
A3301
11.8%
P3301
11.8%
Other Punctuation
ValueCountFrequency (%)
%12540
50.0%
.12540
50.0%
Space Separator
ValueCountFrequency (%)
8301
100.0%
Math Symbol
ValueCountFrequency (%)
+3388
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common79325
74.0%
Latin27943
 
26.0%

Most frequent character per script

Common
ValueCountFrequency (%)
%12540
15.8%
.12540
15.8%
110901
13.7%
08978
11.3%
8301
10.5%
55186
6.5%
+3388
 
4.3%
93361
 
4.2%
22848
 
3.6%
32764
 
3.5%
Other values (4)8518
10.7%
Latin
ValueCountFrequency (%)
I8214
29.4%
C8214
29.4%
D4913
17.6%
A3301
11.8%
P3301
11.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII107268
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
%12540
11.7%
.12540
11.7%
110901
10.2%
08978
 
8.4%
8301
 
7.7%
I8214
 
7.7%
C8214
 
7.7%
55186
 
4.8%
D4913
 
4.6%
+3388
 
3.2%
Other values (9)24093
22.5%

msg_patrocinado
Categorical

CONSTANT
MISSING
REJECTED

Distinct1
Distinct (%)16.7%
Missing12534
Missing (%)> 99.9%
Memory size98.1 KiB
PATROCINADO

Length

Max length11
Median length11
Mean length11
Min length11

Characters and Unicode

Total characters66
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPATROCINADO
2nd rowPATROCINADO
3rd rowPATROCINADO
4th rowPATROCINADO
5th rowPATROCINADO

Common Values

ValueCountFrequency (%)
PATROCINADO6
 
< 0.1%
(Missing)12534
> 99.9%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
patrocinado6
100.0%

Most occurring characters

ValueCountFrequency (%)
A12
18.2%
O12
18.2%
P6
9.1%
T6
9.1%
R6
9.1%
C6
9.1%
I6
9.1%
N6
9.1%
D6
9.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter66
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A12
18.2%
O12
18.2%
P6
9.1%
T6
9.1%
R6
9.1%
C6
9.1%
I6
9.1%
N6
9.1%
D6
9.1%

Most occurring scripts

ValueCountFrequency (%)
Latin66
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
A12
18.2%
O12
18.2%
P6
9.1%
T6
9.1%
R6
9.1%
C6
9.1%
I6
9.1%
N6
9.1%
D6
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII66
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A12
18.2%
O12
18.2%
P6
9.1%
T6
9.1%
R6
9.1%
C6
9.1%
I6
9.1%
N6
9.1%
D6
9.1%

patrocinado
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size12.4 KiB
False
12534 
True
 
6
ValueCountFrequency (%)
False12534
> 99.9%
True6
 
< 0.1%

emissor
Categorical

HIGH CARDINALITY

Distinct406
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
SUZANO
1564 
BANCO BTG PACTUAL
1213 
BANCO BMG
1008 
BANCO DAYCOVAL
 
729
BANCO PAN
 
587
Other values (401)
7439 

Length

Max length72
Median length49
Mean length12.78141946
Min length2

Characters and Unicode

Total characters160279
Distinct characters50
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique180 ?
Unique (%)1.4%

Sample

1st rowCARTEIRA DE PRECATÓRIOS MUNICIPAIS #38/2022
2nd rowCARTEIRA DE PRECATÓRIOS FEDERAIS ALIMENTARES #39/2022
3rd rowCARTEIRA FEDERAL #36/2022
4th rowOPERAÇÃO PRECATÓRIO ESTADUAL - PE #37/2022
5th rowCCB - INCORPORAÇÃO IMOBILIÁRIA RESIDENCIAL - PROJETO CASA IDEAL #04/2022

Common Values

ValueCountFrequency (%)
SUZANO1564
 
12.5%
BANCO BTG PACTUAL1213
 
9.7%
BANCO BMG1008
 
8.0%
BANCO DAYCOVAL729
 
5.8%
BANCO PAN587
 
4.7%
BANCO MODAL499
 
4.0%
DE DESENVOLVIMENTO DE MINAS GERAIS480
 
3.8%
BANCO ABC BRASIL480
 
3.8%
BANCO ALFA433
 
3.5%
BANCO PINE317
 
2.5%
Other values (396)5230
41.7%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
banco8160
29.7%
suzano1564
 
5.7%
btg1213
 
4.4%
pactual1213
 
4.4%
de1193
 
4.3%
bmg1008
 
3.7%
brasil832
 
3.0%
daycoval729
 
2.7%
pan587
 
2.1%
desenvolvimento502
 
1.8%
Other values (529)10469
38.1%

Most occurring characters

ValueCountFrequency (%)
A23625
14.7%
O15714
9.8%
N15672
9.8%
14987
9.4%
B14166
 
8.8%
C12584
 
7.9%
E6888
 
4.3%
S6883
 
4.3%
I6771
 
4.2%
L5636
 
3.5%
Other values (40)37353
23.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter144478
90.1%
Space Separator14987
 
9.4%
Decimal Number381
 
0.2%
Other Punctuation305
 
0.2%
Dash Punctuation47
 
< 0.1%
Open Punctuation38
 
< 0.1%
Close Punctuation38
 
< 0.1%
Math Symbol5
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A23625
16.4%
O15714
10.9%
N15672
10.8%
B14166
9.8%
C12584
8.7%
E6888
 
4.8%
S6883
 
4.8%
I6771
 
4.7%
L5636
 
3.9%
R5472
 
3.8%
Other values (21)31067
21.5%
Decimal Number
ValueCountFrequency (%)
6201
52.8%
260
 
15.7%
036
 
9.4%
325
 
6.6%
124
 
6.3%
514
 
3.7%
97
 
1.8%
86
 
1.6%
76
 
1.6%
42
 
0.5%
Other Punctuation
ValueCountFrequency (%)
.266
87.2%
/26
 
8.5%
'7
 
2.3%
#6
 
2.0%
Space Separator
ValueCountFrequency (%)
14987
100.0%
Dash Punctuation
ValueCountFrequency (%)
-47
100.0%
Open Punctuation
ValueCountFrequency (%)
(38
100.0%
Close Punctuation
ValueCountFrequency (%)
)38
100.0%
Math Symbol
ValueCountFrequency (%)
|5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin144478
90.1%
Common15801
 
9.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
A23625
16.4%
O15714
10.9%
N15672
10.8%
B14166
9.8%
C12584
8.7%
E6888
 
4.8%
S6883
 
4.8%
I6771
 
4.7%
L5636
 
3.9%
R5472
 
3.8%
Other values (21)31067
21.5%
Common
ValueCountFrequency (%)
14987
94.8%
.266
 
1.7%
6201
 
1.3%
260
 
0.4%
-47
 
0.3%
(38
 
0.2%
)38
 
0.2%
036
 
0.2%
/26
 
0.2%
325
 
0.2%
Other values (9)77
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII160246
> 99.9%
None33
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A23625
14.7%
O15714
9.8%
N15672
9.8%
14987
9.4%
B14166
 
8.8%
C12584
 
7.9%
E6888
 
4.3%
S6883
 
4.3%
I6771
 
4.2%
L5636
 
3.5%
Other values (35)37320
23.3%
None
ValueCountFrequency (%)
Ã9
27.3%
Ó9
27.3%
É8
24.2%
Ç5
15.2%
Á2
 
6.1%

liquidez
Categorical

HIGH CORRELATION

Distinct21
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
No vencimento
8765 
No Vencimento
3293 
SECUNDARIO
 
252
D+90
 
96
Diária
 
79
Other values (16)
 
55

Length

Max length20
Median length13
Mean length12.79617225
Min length3

Characters and Unicode

Total characters160464
Distinct characters38
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)0.1%

Sample

1st rowNo vencimento
2nd rowNo vencimento
3rd rowNo vencimento
4th rowNo vencimento
5th rowNo vencimento

Common Values

ValueCountFrequency (%)
No vencimento8765
69.9%
No Vencimento3293
 
26.3%
SECUNDARIO252
 
2.0%
D+9096
 
0.8%
Diária79
 
0.6%
D+18010
 
0.1%
No vencimento 6
 
< 0.1%
D+155
 
< 0.1%
D+305
 
< 0.1%
D+3605
 
< 0.1%
Other values (11)24
 
0.2%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
no12064
49.0%
vencimento12064
49.0%
secundario252
 
1.0%
d+9096
 
0.4%
diária80
 
0.3%
d+18010
 
< 0.1%
d+7205
 
< 0.1%
d+35
 
< 0.1%
d+185
 
< 0.1%
d+3605
 
< 0.1%
Other values (13)23
 
0.1%

Most occurring characters

ValueCountFrequency (%)
n24130
15.0%
o24130
15.0%
e24128
15.0%
N12316
7.7%
i12227
7.6%
12075
7.5%
c12066
7.5%
m12066
7.5%
t12064
7.5%
v8771
 
5.5%
Other values (28)6491
 
4.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter129837
80.9%
Uppercase Letter18103
 
11.3%
Space Separator12075
 
7.5%
Decimal Number307
 
0.2%
Math Symbol142
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n24130
18.6%
o24130
18.6%
e24128
18.6%
i12227
9.4%
c12066
9.3%
m12066
9.3%
t12064
9.3%
v8771
 
6.8%
a86
 
0.1%
r82
 
0.1%
Other values (6)87
 
0.1%
Uppercase Letter
ValueCountFrequency (%)
N12316
68.0%
V3293
 
18.2%
D474
 
2.6%
C256
 
1.4%
A252
 
1.4%
I252
 
1.4%
R252
 
1.4%
O252
 
1.4%
U252
 
1.4%
E252
 
1.4%
Decimal Number
ValueCountFrequency (%)
0122
39.7%
996
31.3%
126
 
8.5%
819
 
6.2%
317
 
5.5%
29
 
2.9%
67
 
2.3%
76
 
2.0%
55
 
1.6%
Space Separator
ValueCountFrequency (%)
12075
100.0%
Math Symbol
ValueCountFrequency (%)
+142
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin147940
92.2%
Common12524
 
7.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
n24130
16.3%
o24130
16.3%
e24128
16.3%
N12316
8.3%
i12227
8.3%
c12066
8.2%
m12066
8.2%
t12064
8.2%
v8771
 
5.9%
V3293
 
2.2%
Other values (17)2749
 
1.9%
Common
ValueCountFrequency (%)
12075
96.4%
+142
 
1.1%
0122
 
1.0%
996
 
0.8%
126
 
0.2%
819
 
0.2%
317
 
0.1%
29
 
0.1%
67
 
0.1%
76
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII160381
99.9%
None83
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n24130
15.0%
o24130
15.0%
e24128
15.0%
N12316
7.7%
i12227
7.6%
12075
7.5%
c12066
7.5%
m12066
7.5%
t12064
7.5%
v8771
 
5.5%
Other values (25)6408
 
4.0%
None
ValueCountFrequency (%)
á80
96.4%
ê2
 
2.4%
ó1
 
1.2%

incentivada
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size12.4 KiB
False
9192 
True
3348 
ValueCountFrequency (%)
False9192
73.3%
True3348
 
26.7%

qualificado
Boolean

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)< 0.1%
Missing6967
Missing (%)55.6%
Memory size98.1 KiB
False
5426 
True
 
147
(Missing)
6967 
ValueCountFrequency (%)
False5426
43.3%
True147
 
1.2%
(Missing)6967
55.6%

juros
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1277
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.84182057
Minimum0
Maximum236.34
Zeros17
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile5.25
Q19
median13.7
Q3100.25
95-th percentile113
Maximum236.34
Range236.34
Interquartile range (IQR)91.25

Descriptive statistics

Standard deviation45.87722992
Coefficient of variation (CV)1.023090261
Kurtosis-1.545007454
Mean44.84182057
Median Absolute Deviation (MAD)7.6
Skewness0.5770091296
Sum562316.43
Variance2104.720225
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9586
 
4.7%
15563
 
4.5%
115277
 
2.2%
100234
 
1.9%
109214
 
1.7%
106205
 
1.6%
104191
 
1.5%
112184
 
1.5%
102160
 
1.3%
108153
 
1.2%
Other values (1267)9773
77.9%
ValueCountFrequency (%)
017
0.1%
0.021
 
< 0.1%
0.041
 
< 0.1%
0.073
 
< 0.1%
0.091
 
< 0.1%
0.12
 
< 0.1%
0.122
 
< 0.1%
0.141
 
< 0.1%
0.151
 
< 0.1%
0.183
 
< 0.1%
ValueCountFrequency (%)
236.341
< 0.1%
235.521
< 0.1%
221.51
< 0.1%
2131
< 0.1%
1891
< 0.1%
163.61
< 0.1%
1351
< 0.1%
128.51
< 0.1%
128.251
< 0.1%
1281
< 0.1%

amortizacao
Categorical

HIGH CARDINALITY
HIGH CORRELATION
MISSING

Distinct54
Distinct (%)1.8%
Missing9505
Missing (%)75.8%
Memory size98.1 KiB
Vencimento
2969 
Sem Amortização
 
6
Anual a partir de: 17/06/2030
 
4
Mensal a partir de: 15/08/2024
 
2
Anual a partir de: 15/07/2025
 
2
Other values (49)
 
52

Length

Max length34
Median length10
Mean length10.40428336
Min length10

Characters and Unicode

Total characters31577
Distinct characters35
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)1.5%

Sample

1st rowVencimento
2nd rowVencimento
3rd rowVencimento
4th rowVencimento
5th rowVencimento

Common Values

ValueCountFrequency (%)
Vencimento2969
 
23.7%
Sem Amortização6
 
< 0.1%
Anual a partir de: 17/06/20304
 
< 0.1%
Mensal a partir de: 15/08/20242
 
< 0.1%
Anual a partir de: 15/07/20252
 
< 0.1%
Anual a partir de: 17/05/20272
 
< 0.1%
Anual a partir de: 16/11/20282
 
< 0.1%
Anual a partir de: 17/07/20282
 
< 0.1%
Anual a partir de: 15/08/20231
 
< 0.1%
Anual a partir de: 15/10/20231
 
< 0.1%
Other values (44)44
 
0.4%
(Missing)9505
75.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
vencimento2969
90.5%
partir60
 
1.8%
de60
 
1.8%
a60
 
1.8%
anual43
 
1.3%
semestral8
 
0.2%
amortização6
 
0.2%
sem6
 
0.2%
mensal5
 
0.2%
trimestral4
 
0.1%
Other values (50)60
 
1.8%

Most occurring characters

ValueCountFrequency (%)
e6029
19.1%
n5986
19.0%
t3047
9.6%
i3039
9.6%
m2993
9.5%
o2981
9.4%
V2969
9.4%
c2969
9.4%
246
 
0.8%
a186
 
0.6%
Other values (25)1132
 
3.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter27630
87.5%
Uppercase Letter3041
 
9.6%
Decimal Number480
 
1.5%
Space Separator246
 
0.8%
Other Punctuation180
 
0.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e6029
21.8%
n5986
21.7%
t3047
11.0%
i3039
11.0%
m2993
10.8%
o2981
10.8%
c2969
10.7%
a186
 
0.7%
r142
 
0.5%
p60
 
0.2%
Other values (7)198
 
0.7%
Decimal Number
ValueCountFrequency (%)
0124
25.8%
2118
24.6%
184
17.5%
555
11.5%
722
 
4.6%
620
 
4.2%
317
 
3.5%
816
 
3.3%
414
 
2.9%
910
 
2.1%
Uppercase Letter
ValueCountFrequency (%)
V2969
97.6%
A49
 
1.6%
S14
 
0.5%
M5
 
0.2%
T4
 
0.1%
Other Punctuation
ValueCountFrequency (%)
/120
66.7%
:60
33.3%
Space Separator
ValueCountFrequency (%)
246
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin30671
97.1%
Common906
 
2.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e6029
19.7%
n5986
19.5%
t3047
9.9%
i3039
9.9%
m2993
9.8%
o2981
9.7%
V2969
9.7%
c2969
9.7%
a186
 
0.6%
r142
 
0.5%
Other values (12)330
 
1.1%
Common
ValueCountFrequency (%)
246
27.2%
0124
13.7%
/120
13.2%
2118
13.0%
184
 
9.3%
:60
 
6.6%
555
 
6.1%
722
 
2.4%
620
 
2.2%
317
 
1.9%
Other values (3)40
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII31565
> 99.9%
None12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e6029
19.1%
n5986
19.0%
t3047
9.7%
i3039
9.6%
m2993
9.5%
o2981
9.4%
V2969
9.4%
c2969
9.4%
246
 
0.8%
a186
 
0.6%
Other values (23)1120
 
3.5%
None
ValueCountFrequency (%)
ã6
50.0%
ç6
50.0%

carencia
Categorical

HIGH CARDINALITY
MISSING

Distinct392
Distinct (%)3.4%
Missing1006
Missing (%)8.0%
Memory size98.1 KiB
Sem carência
9056 
D+0
 
514
Vencimento
 
375
0
 
183
721 dias dias
 
47
Other values (387)
1359 

Length

Max length20
Median length12
Mean length11.08704699
Min length1

Characters and Unicode

Total characters127878
Distinct characters40
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique201 ?
Unique (%)1.7%

Sample

1st rowNo vencimento
2nd rowNo vencimento
3rd rowNo vencimento
4th rowNo vencimento
5th rowNo vencimento

Common Values

ValueCountFrequency (%)
Sem carência9056
72.2%
D+0514
 
4.1%
Vencimento375
 
3.0%
0183
 
1.5%
721 dias dias47
 
0.4%
1080 dias41
 
0.3%
1080 dias dias37
 
0.3%
72129
 
0.2%
108026
 
0.2%
361 dias dias25
 
0.2%
Other values (382)1201
 
9.6%
(Missing)1006
 
8.0%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
sem9056
41.3%
carência9056
41.3%
dias1284
 
5.9%
d+0514
 
2.3%
vencimento404
 
1.8%
0183
 
0.8%
1080104
 
0.5%
72199
 
0.5%
144050
 
0.2%
36149
 
0.2%
Other values (321)1106
 
5.0%

Most occurring characters

ValueCountFrequency (%)
a19400
15.2%
c18507
14.5%
i10741
8.4%
10377
8.1%
n9846
7.7%
e9846
7.7%
m9451
7.4%
r9059
7.1%
S9056
7.1%
ê9056
7.1%
Other values (30)12539
9.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter99310
77.7%
Space Separator10377
 
8.1%
Uppercase Letter10081
 
7.9%
Decimal Number6907
 
5.4%
Math Symbol519
 
0.4%
Other Punctuation438
 
0.3%
Dash Punctuation246
 
0.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a19400
19.5%
c18507
18.6%
i10741
10.8%
n9846
9.9%
e9846
9.9%
m9451
9.5%
r9059
9.1%
ê9056
9.1%
s1285
 
1.3%
d1284
 
1.3%
Other values (6)835
 
0.8%
Uppercase Letter
ValueCountFrequency (%)
S9056
89.8%
D522
 
5.2%
V384
 
3.8%
N47
 
0.5%
O18
 
0.2%
E18
 
0.2%
C9
 
0.1%
I9
 
0.1%
M9
 
0.1%
T9
 
0.1%
Decimal Number
ValueCountFrequency (%)
02160
31.3%
21274
18.4%
11136
16.4%
3479
 
6.9%
8397
 
5.7%
6327
 
4.7%
9311
 
4.5%
4285
 
4.1%
5281
 
4.1%
7257
 
3.7%
Space Separator
ValueCountFrequency (%)
10377
100.0%
Math Symbol
ValueCountFrequency (%)
+519
100.0%
Other Punctuation
ValueCountFrequency (%)
/438
100.0%
Dash Punctuation
ValueCountFrequency (%)
-246
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin109391
85.5%
Common18487
 
14.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a19400
17.7%
c18507
16.9%
i10741
9.8%
n9846
9.0%
e9846
9.0%
m9451
8.6%
r9059
8.3%
S9056
8.3%
ê9056
8.3%
s1285
 
1.2%
Other values (16)3144
 
2.9%
Common
ValueCountFrequency (%)
10377
56.1%
02160
 
11.7%
21274
 
6.9%
11136
 
6.1%
+519
 
2.8%
3479
 
2.6%
/438
 
2.4%
8397
 
2.1%
6327
 
1.8%
9311
 
1.7%
Other values (4)1069
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII118818
92.9%
None9060
 
7.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a19400
16.3%
c18507
15.6%
i10741
9.0%
10377
8.7%
n9846
8.3%
e9846
8.3%
m9451
8.0%
r9059
7.6%
S9056
7.6%
02160
 
1.8%
Other values (27)10375
8.7%
None
ValueCountFrequency (%)
ê9056
> 99.9%
á3
 
< 0.1%
ó1
 
< 0.1%

rating
Categorical

HIGH CORRELATION
MISSING

Distinct26
Distinct (%)0.5%
Missing7629
Missing (%)60.8%
Memory size98.1 KiB
AAA
954 
BB+
512 
A
470 
BBB-
416 
B+
388 
Other values (21)
2171 

Length

Max length6
Median length5
Mean length2.993280391
Min length1

Characters and Unicode

Total characters14700
Distinct characters7
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st rowbrAAA
2nd rowbrAAA
3rd rowbrAAA
4th rowAAA.br
5th rowbrAAA

Common Values

ValueCountFrequency (%)
AAA954
 
7.6%
BB+512
 
4.1%
A470
 
3.7%
BBB-416
 
3.3%
B+388
 
3.1%
AA377
 
3.0%
brAAA333
 
2.7%
A-282
 
2.2%
BBB+250
 
2.0%
AA+209
 
1.7%
Other values (16)720
 
5.7%
(Missing)7629
60.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
aaa954
19.4%
bbb846
17.2%
a811
16.5%
aa663
13.5%
bb533
10.9%
b389
7.9%
braaa333
 
6.8%
bra165
 
3.4%
aaa.br101
 
2.1%
braa91
 
1.9%
Other values (4)25
 
0.5%

Most occurring characters

ValueCountFrequency (%)
A6676
45.4%
B4020
27.3%
+1504
 
10.2%
-949
 
6.5%
b715
 
4.9%
r715
 
4.9%
.121
 
0.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter10696
72.8%
Math Symbol1504
 
10.2%
Lowercase Letter1430
 
9.7%
Dash Punctuation949
 
6.5%
Other Punctuation121
 
0.8%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A6676
62.4%
B4020
37.6%
Lowercase Letter
ValueCountFrequency (%)
b715
50.0%
r715
50.0%
Math Symbol
ValueCountFrequency (%)
+1504
100.0%
Dash Punctuation
ValueCountFrequency (%)
-949
100.0%
Other Punctuation
ValueCountFrequency (%)
.121
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin12126
82.5%
Common2574
 
17.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
A6676
55.1%
B4020
33.2%
b715
 
5.9%
r715
 
5.9%
Common
ValueCountFrequency (%)
+1504
58.4%
-949
36.9%
.121
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII14700
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A6676
45.4%
B4020
27.3%
+1504
 
10.2%
-949
 
6.5%
b715
 
4.9%
r715
 
4.9%
.121
 
0.8%

agencia
Categorical

HIGH CORRELATION
MISSING

Distinct8
Distinct (%)0.2%
Missing7629
Missing (%)60.8%
Memory size98.1 KiB
Fitch
2992 
S&P
1359 
Moodys
 
170
FITCH
 
164
Moody´s
 
121
Other values (3)
 
105

Length

Max length8
Median length5
Mean length4.580737121
Min length3

Characters and Unicode

Total characters22496
Distinct characters28
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowS&P
2nd rowS&P
3rd rowS&P
4th rowMoody´s
5th rowS&P

Common Values

ValueCountFrequency (%)
Fitch2992
 
23.9%
S&P1359
 
10.8%
Moodys170
 
1.4%
FITCH164
 
1.3%
Moody´s121
 
1.0%
LFRating71
 
0.6%
MOODYS21
 
0.2%
Austin13
 
0.1%
(Missing)7629
60.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
fitch3156
64.3%
s&p1359
27.7%
moodys191
 
3.9%
moody´s121
 
2.5%
lfrating71
 
1.4%
austin13
 
0.3%

Most occurring characters

ValueCountFrequency (%)
F3227
14.3%
t3076
13.7%
i3076
13.7%
c2992
13.3%
h2992
13.3%
S1380
6.1%
&1359
6.0%
P1359
6.0%
o582
 
2.6%
M312
 
1.4%
Other values (18)2141
9.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter13843
61.5%
Uppercase Letter7173
31.9%
Other Punctuation1359
 
6.0%
Modifier Symbol121
 
0.5%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
F3227
45.0%
S1380
19.2%
P1359
18.9%
M312
 
4.3%
C164
 
2.3%
H164
 
2.3%
T164
 
2.3%
I164
 
2.3%
L71
 
1.0%
R71
 
1.0%
Other values (4)97
 
1.4%
Lowercase Letter
ValueCountFrequency (%)
t3076
22.2%
i3076
22.2%
c2992
21.6%
h2992
21.6%
o582
 
4.2%
s304
 
2.2%
d291
 
2.1%
y291
 
2.1%
n84
 
0.6%
a71
 
0.5%
Other values (2)84
 
0.6%
Other Punctuation
ValueCountFrequency (%)
&1359
100.0%
Modifier Symbol
ValueCountFrequency (%)
´121
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin21016
93.4%
Common1480
 
6.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
F3227
15.4%
t3076
14.6%
i3076
14.6%
c2992
14.2%
h2992
14.2%
S1380
6.6%
P1359
6.5%
o582
 
2.8%
M312
 
1.5%
s304
 
1.4%
Other values (16)1716
8.2%
Common
ValueCountFrequency (%)
&1359
91.8%
´121
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII22375
99.5%
None121
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
F3227
14.4%
t3076
13.7%
i3076
13.7%
c2992
13.4%
h2992
13.4%
S1380
6.2%
&1359
6.1%
P1359
6.1%
o582
 
2.6%
M312
 
1.4%
Other values (17)2020
9.0%
None
ValueCountFrequency (%)
´121
100.0%

preco
Real number (ℝ≥0)

HIGH CORRELATION

Distinct2767
Distinct (%)22.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11011.19999
Minimum0.94
Maximum1000000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0.94
5-th percentile180.049
Q11036.9575
median10000
Q310000
95-th percentile10000
Maximum1000000
Range999999.06
Interquartile range (IQR)8963.0425

Descriptive statistics

Standard deviation56312.14819
Coefficient of variation (CV)5.114079141
Kurtosis216.6178903
Mean11011.19999
Median Absolute Deviation (MAD)5000
Skewness14.1703607
Sum138080447.9
Variance3171058033
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
100006115
48.8%
10001757
 
14.0%
5000476
 
3.8%
20000221
 
1.8%
50000115
 
0.9%
500100
 
0.8%
3000064
 
0.5%
163
 
0.5%
5044
 
0.4%
10044
 
0.4%
Other values (2757)3541
28.2%
ValueCountFrequency (%)
0.942
< 0.1%
0.961
< 0.1%
0.960382671
< 0.1%
0.975395531
< 0.1%
0.976224411
< 0.1%
0.979631881
< 0.1%
0.982067631
< 0.1%
0.98368341
< 0.1%
0.986017031
< 0.1%
0.986529551
< 0.1%
ValueCountFrequency (%)
100000021
0.2%
75000021
0.2%
50000021
0.2%
392178.56321
 
< 0.1%
342895.70461
 
< 0.1%
250976.8971
 
< 0.1%
25000026
0.2%
209539.52831
 
< 0.1%
157803.79841
 
< 0.1%
1000005
 
< 0.1%

corretora
Categorical

HIGH CORRELATION

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
Banco Safra
6967 
XP
1367 
Rico
837 
Nuinvest
 
669
Nova Futura
 
533
Other values (19)
2167 

Length

Max length20
Median length11
Mean length9.576315789
Min length2

Characters and Unicode

Total characters120087
Distinct characters38
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowHurst Capital
2nd rowHurst Capital
3rd rowHurst Capital
4th rowHurst Capital
5th rowHurst Capital

Common Values

ValueCountFrequency (%)
Banco Safra6967
55.6%
XP1367
 
10.9%
Rico837
 
6.7%
Nuinvest669
 
5.3%
Nova Futura533
 
4.3%
Ágora514
 
4.1%
Ativa Investimentos375
 
3.0%
modalmais314
 
2.5%
Terra Investimentos275
 
2.2%
Banco Daycoval185
 
1.5%
Other values (14)504
 
4.0%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
banco7368
34.5%
safra6967
32.6%
xp1367
 
6.4%
rico837
 
3.9%
nuinvest669
 
3.1%
investimentos656
 
3.1%
nova533
 
2.5%
futura533
 
2.5%
ágora514
 
2.4%
ativa375
 
1.8%
Other values (24)1533
 
7.2%

Most occurring characters

ValueCountFrequency (%)
a25200
21.0%
o10473
8.7%
n9613
 
8.0%
r8902
 
7.4%
8812
 
7.3%
c8490
 
7.1%
B7475
 
6.2%
S6990
 
5.8%
f6990
 
5.8%
i3290
 
2.7%
Other values (28)23852
19.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter88762
73.9%
Uppercase Letter22513
 
18.7%
Space Separator8812
 
7.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a25200
28.4%
o10473
11.8%
n9613
 
10.8%
r8902
 
10.0%
c8490
 
9.6%
f6990
 
7.9%
i3290
 
3.7%
t3276
 
3.7%
e2515
 
2.8%
v2462
 
2.8%
Other values (9)7551
 
8.5%
Uppercase Letter
ValueCountFrequency (%)
B7475
33.2%
S6990
31.0%
P1521
 
6.8%
X1367
 
6.1%
N1202
 
5.3%
R843
 
3.7%
I799
 
3.5%
F557
 
2.5%
Á514
 
2.3%
A399
 
1.8%
Other values (8)846
 
3.8%
Space Separator
ValueCountFrequency (%)
8812
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin111275
92.7%
Common8812
 
7.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
a25200
22.6%
o10473
9.4%
n9613
 
8.6%
r8902
 
8.0%
c8490
 
7.6%
B7475
 
6.7%
S6990
 
6.3%
f6990
 
6.3%
i3290
 
3.0%
t3276
 
2.9%
Other values (27)20576
18.5%
Common
ValueCountFrequency (%)
8812
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII119440
99.5%
None647
 
0.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a25200
21.1%
o10473
8.8%
n9613
 
8.0%
r8902
 
7.5%
8812
 
7.4%
c8490
 
7.1%
B7475
 
6.3%
S6990
 
5.9%
f6990
 
5.9%
i3290
 
2.8%
Other values (25)23205
19.4%
None
ValueCountFrequency (%)
Á514
79.4%
Ó123
 
19.0%
ú10
 
1.5%

nr
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.36275917
Minimum0
Maximum25
Zeros2426
Zeros (%)19.3%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q112
median23
Q325
95-th percentile25
Maximum25
Range25
Interquartile range (IQR)13

Descriptive statistics

Standard deviation9.483136794
Coefficient of variation (CV)0.5461768317
Kurtosis-0.6253925903
Mean17.36275917
Median Absolute Deviation (MAD)2
Skewness-0.9626859936
Sum217729
Variance89.92988345
MonotonicityNot monotonic
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
254940
39.4%
02426
19.3%
121220
 
9.7%
23920
 
7.3%
19779
 
6.2%
24738
 
5.9%
15514
 
4.1%
16386
 
3.1%
20181
 
1.4%
21160
 
1.3%
Other values (4)276
 
2.2%
ValueCountFrequency (%)
02426
19.3%
121220
9.7%
1326
 
0.2%
15514
 
4.1%
16386
 
3.1%
17108
 
0.9%
1862
 
0.5%
19779
 
6.2%
20181
 
1.4%
21160
 
1.3%
ValueCountFrequency (%)
254940
39.4%
24738
 
5.9%
23920
 
7.3%
2280
 
0.6%
21160
 
1.3%
20181
 
1.4%
19779
 
6.2%
1862
 
0.5%
17108
 
0.9%
16386
 
3.1%

a
Categorical

HIGH CORRELATION

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
Safra
6967 
XP
1367 
Rico
837 
Nuinvest
 
669
Nova Futura
 
533
Other values (19)
2167 

Length

Max length19
Median length5
Mean length6.236124402
Min length2

Characters and Unicode

Total characters78201
Distinct characters37
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowHurst Capital
2nd rowHurst Capital
3rd rowHurst Capital
4th rowHurst Capital
5th rowHurst Capital

Common Values

ValueCountFrequency (%)
Safra6967
55.6%
XP1367
 
10.9%
Rico837
 
6.7%
Nuinvest669
 
5.3%
Nova Futura533
 
4.3%
Ágora514
 
4.1%
Ativa Investimentos375
 
3.0%
modalmais314
 
2.5%
Terra Investimentos275
 
2.2%
Banco Daycoval185
 
1.5%
Other values (14)504
 
4.0%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
safra6967
48.5%
xp1367
 
9.5%
rico837
 
5.8%
nuinvest669
 
4.7%
investimentos650
 
4.5%
nova533
 
3.7%
futura533
 
3.7%
ágora514
 
3.6%
banco401
 
2.8%
ativa375
 
2.6%
Other values (24)1533
 
10.7%

Most occurring characters

ValueCountFrequency (%)
a18233
23.3%
r8902
11.4%
S6990
 
8.9%
f6990
 
8.9%
o3500
 
4.5%
i3284
 
4.2%
t3264
 
4.2%
n2634
 
3.4%
e2503
 
3.2%
v2456
 
3.1%
Other values (27)19445
24.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter60822
77.8%
Uppercase Letter15540
 
19.9%
Space Separator1839
 
2.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a18233
30.0%
r8902
14.6%
f6990
 
11.5%
o3500
 
5.8%
i3284
 
5.4%
t3264
 
5.4%
n2634
 
4.3%
e2503
 
4.1%
v2456
 
4.0%
s2365
 
3.9%
Other values (9)6691
 
11.0%
Uppercase Letter
ValueCountFrequency (%)
S6990
45.0%
P1521
 
9.8%
X1367
 
8.8%
N1202
 
7.7%
R843
 
5.4%
I793
 
5.1%
F557
 
3.6%
Á514
 
3.3%
B508
 
3.3%
A399
 
2.6%
Other values (7)846
 
5.4%
Space Separator
ValueCountFrequency (%)
1839
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin76362
97.6%
Common1839
 
2.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a18233
23.9%
r8902
11.7%
S6990
 
9.2%
f6990
 
9.2%
o3500
 
4.6%
i3284
 
4.3%
t3264
 
4.3%
n2634
 
3.4%
e2503
 
3.3%
v2456
 
3.2%
Other values (26)17606
23.1%
Common
ValueCountFrequency (%)
1839
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII77677
99.3%
None524
 
0.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a18233
23.5%
r8902
11.5%
S6990
 
9.0%
f6990
 
9.0%
o3500
 
4.5%
i3284
 
4.2%
t3264
 
4.2%
n2634
 
3.4%
e2503
 
3.2%
v2456
 
3.2%
Other values (25)18921
24.4%
None
ValueCountFrequency (%)
Á514
98.1%
ú10
 
1.9%

investir
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size12.4 KiB
False
12417 
True
 
123
ValueCountFrequency (%)
False12417
99.0%
True123
 
1.0%

tp_d
Categorical

HIGH CORRELATION
MISSING

Distinct6
Distinct (%)< 0.1%
Missing514
Missing (%)4.1%
Memory size98.1 KiB
Banco
7302 
Corretora
3313 
Distribuidora
 
712
Nuinvest
 
669
Financeira
 
24

Length

Max length13
Median length5
Mean length6.753450856
Min length5

Characters and Unicode

Total characters81217
Distinct characters19
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFintech
2nd rowFintech
3rd rowFintech
4th rowFintech
5th rowFintech

Common Values

ValueCountFrequency (%)
Banco7302
58.2%
Corretora3313
26.4%
Distribuidora712
 
5.7%
Nuinvest669
 
5.3%
Financeira24
 
0.2%
Fintech6
 
< 0.1%
(Missing)514
 
4.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
banco7302
60.7%
corretora3313
27.5%
distribuidora712
 
5.9%
nuinvest669
 
5.6%
financeira24
 
0.2%
fintech6
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
o14640
18.0%
r11387
14.0%
a11375
14.0%
n8025
9.9%
c7332
9.0%
B7302
9.0%
t4700
 
5.8%
e4012
 
4.9%
C3313
 
4.1%
i2859
 
3.5%
Other values (9)6272
7.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter69191
85.2%
Uppercase Letter12026
 
14.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o14640
21.2%
r11387
16.5%
a11375
16.4%
n8025
11.6%
c7332
10.6%
t4700
 
6.8%
e4012
 
5.8%
i2859
 
4.1%
u1381
 
2.0%
s1381
 
2.0%
Other values (4)2099
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
B7302
60.7%
C3313
27.5%
D712
 
5.9%
N669
 
5.6%
F30
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Latin81217
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o14640
18.0%
r11387
14.0%
a11375
14.0%
n8025
9.9%
c7332
9.0%
B7302
9.0%
t4700
 
5.8%
e4012
 
4.9%
C3313
 
4.1%
i2859
 
3.5%
Other values (9)6272
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII81217
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o14640
18.0%
r11387
14.0%
a11375
14.0%
n8025
9.9%
c7332
9.0%
B7302
9.0%
t4700
 
5.8%
e4012
 
4.9%
C3313
 
4.1%
i2859
 
3.5%
Other values (9)6272
7.7%

chat
Boolean

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)< 0.1%
Missing304
Missing (%)2.4%
Memory size98.1 KiB
False
11383 
True
 
853
(Missing)
 
304
ValueCountFrequency (%)
False11383
90.8%
True853
 
6.8%
(Missing)304
 
2.4%

enable_popup
Boolean

CONSTANT
MISSING
REJECTED

Distinct1
Distinct (%)8.3%
Missing12528
Missing (%)99.9%
Memory size98.1 KiB
True
 
12
(Missing)
12528 
ValueCountFrequency (%)
True12
 
0.1%
(Missing)12528
99.9%

url
Categorical

HIGH CORRELATION
MISSING

Distinct14
Distinct (%)5.1%
Missing12265
Missing (%)97.8%
Memory size98.1 KiB
https://cadastro.ativainvestimentos.com.br/?utm_source=apprendafixa&utm_medium=email&utm_campaign=debenture_2508
252 
https://bit.ly/3gcKY8b
 
11
https://hurst.capital/operation/960dfe3e-e7a3-4d77-bc61-f5dcf40cdb12
 
1
https://hurst.capital/operation/0e0349ff-81f5-4a71-ace1-ea50c5aa9731
 
1
https://hurst.capital/operation/97294719-f536-4c72-8a6d-3fb12841f10f
 
1
Other values (9)
 
9

Length

Max length112
Median length112
Mean length105.8909091
Min length22

Characters and Unicode

Total characters29120
Distinct characters43
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)4.4%

Sample

1st rowhttps://hurst.capital/operation/960dfe3e-e7a3-4d77-bc61-f5dcf40cdb12
2nd rowhttps://hurst.capital/operation/0e0349ff-81f5-4a71-ace1-ea50c5aa9731
3rd rowhttps://hurst.capital/operation/97294719-f536-4c72-8a6d-3fb12841f10f
4th rowhttps://hurst.capital/operation/3550baf5-a7a1-4b4e-91f0-158ddda60971
5th rowhttps://hurst.capital/operation/60350e38-8cb4-4fa3-83d3-25cd18491474

Common Values

ValueCountFrequency (%)
https://cadastro.ativainvestimentos.com.br/?utm_source=apprendafixa&utm_medium=email&utm_campaign=debenture_2508252
 
2.0%
https://bit.ly/3gcKY8b11
 
0.1%
https://hurst.capital/operation/960dfe3e-e7a3-4d77-bc61-f5dcf40cdb121
 
< 0.1%
https://hurst.capital/operation/0e0349ff-81f5-4a71-ace1-ea50c5aa97311
 
< 0.1%
https://hurst.capital/operation/97294719-f536-4c72-8a6d-3fb12841f10f1
 
< 0.1%
https://hurst.capital/operation/3550baf5-a7a1-4b4e-91f0-158ddda609711
 
< 0.1%
https://hurst.capital/operation/60350e38-8cb4-4fa3-83d3-25cd184914741
 
< 0.1%
https://hurst.capital/operation/73620874-ecac-431d-862a-8f542c225b091
 
< 0.1%
https://bancorci.onelink.me/J1lr/a944407e1
 
< 0.1%
https://bancorci.onelink.me/J1lr/a5dbb6eb1
 
< 0.1%
Other values (4)4
 
< 0.1%
(Missing)12265
97.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
https://cadastro.ativainvestimentos.com.br/?utm_source=apprendafixa&utm_medium=email&utm_campaign=debenture_2508252
91.6%
https://bit.ly/3gcky8b11
 
4.0%
https://hurst.capital/operation/960dfe3e-e7a3-4d77-bc61-f5dcf40cdb121
 
0.4%
https://hurst.capital/operation/0e0349ff-81f5-4a71-ace1-ea50c5aa97311
 
0.4%
https://hurst.capital/operation/97294719-f536-4c72-8a6d-3fb12841f10f1
 
0.4%
https://hurst.capital/operation/3550baf5-a7a1-4b4e-91f0-158ddda609711
 
0.4%
https://hurst.capital/operation/60350e38-8cb4-4fa3-83d3-25cd184914741
 
0.4%
https://hurst.capital/operation/73620874-ecac-431d-862a-8f542c225b091
 
0.4%
https://bancorci.onelink.me/j1lr/a944407e1
 
0.4%
https://bancorci.onelink.me/j1lr/a5dbb6eb1
 
0.4%
Other values (4)4
 
1.5%

Most occurring characters

ValueCountFrequency (%)
t2595
 
8.9%
a2562
 
8.8%
e2300
 
7.9%
m2274
 
7.8%
i1799
 
6.2%
u1518
 
5.2%
s1289
 
4.4%
n1284
 
4.4%
r1284
 
4.4%
c1049
 
3.6%
Other values (33)11166
38.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter23458
80.6%
Other Punctuation2653
 
9.1%
Decimal Number1193
 
4.1%
Connector Punctuation1008
 
3.5%
Math Symbol756
 
2.6%
Uppercase Letter28
 
0.1%
Dash Punctuation24
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t2595
11.1%
a2562
10.9%
e2300
9.8%
m2274
9.7%
i1799
 
7.7%
u1518
 
6.5%
s1289
 
5.5%
n1284
 
5.5%
r1284
 
5.5%
c1049
 
4.5%
Other values (12)5504
23.5%
Decimal Number
ValueCountFrequency (%)
8277
23.2%
0270
22.6%
5266
22.3%
2264
22.1%
327
 
2.3%
124
 
2.0%
421
 
1.8%
718
 
1.5%
614
 
1.2%
912
 
1.0%
Other Punctuation
ValueCountFrequency (%)
/837
31.5%
.785
29.6%
&504
19.0%
:275
 
10.4%
?252
 
9.5%
Uppercase Letter
ValueCountFrequency (%)
K11
39.3%
Y11
39.3%
J6
21.4%
Connector Punctuation
ValueCountFrequency (%)
_1008
100.0%
Math Symbol
ValueCountFrequency (%)
=756
100.0%
Dash Punctuation
ValueCountFrequency (%)
-24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin23486
80.7%
Common5634
 
19.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
t2595
11.0%
a2562
10.9%
e2300
9.8%
m2274
9.7%
i1799
 
7.7%
u1518
 
6.5%
s1289
 
5.5%
n1284
 
5.5%
r1284
 
5.5%
c1049
 
4.5%
Other values (15)5532
23.6%
Common
ValueCountFrequency (%)
_1008
17.9%
/837
14.9%
.785
13.9%
=756
13.4%
&504
8.9%
8277
 
4.9%
:275
 
4.9%
0270
 
4.8%
5266
 
4.7%
2264
 
4.7%
Other values (8)392
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII29120
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t2595
 
8.9%
a2562
 
8.8%
e2300
 
7.9%
m2274
 
7.8%
i1799
 
6.2%
u1518
 
5.2%
s1289
 
4.4%
n1284
 
4.4%
r1284
 
4.4%
c1049
 
3.6%
Other values (33)11166
38.3%

tp_mercado
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
P
9025 
S
3515 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters12540
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowP
2nd rowP
3rd rowP
4th rowP
5th rowP

Common Values

ValueCountFrequency (%)
P9025
72.0%
S3515
 
28.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
p9025
72.0%
s3515
 
28.0%

Most occurring characters

ValueCountFrequency (%)
P9025
72.0%
S3515
 
28.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter12540
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
P9025
72.0%
S3515
 
28.0%

Most occurring scripts

ValueCountFrequency (%)
Latin12540
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
P9025
72.0%
S3515
 
28.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII12540
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
P9025
72.0%
S3515
 
28.0%

tir
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
15.0
4678 
0.0
3441 
17.5
2135 
20.0
1145 
22.5
1141 

Length

Max length4
Median length4
Mean length3.725598086
Min length3

Characters and Unicode

Total characters46719
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row17.5
4th row0.0
5th row15.0

Common Values

ValueCountFrequency (%)
15.04678
37.3%
0.03441
27.4%
17.52135
17.0%
20.01145
 
9.1%
22.51141
 
9.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
15.04678
37.3%
0.03441
27.4%
17.52135
17.0%
20.01145
 
9.1%
22.51141
 
9.1%

Most occurring characters

ValueCountFrequency (%)
013850
29.6%
.12540
26.8%
57954
17.0%
16813
14.6%
23427
 
7.3%
72135
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number34179
73.2%
Other Punctuation12540
 
26.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
013850
40.5%
57954
23.3%
16813
19.9%
23427
 
10.0%
72135
 
6.2%
Other Punctuation
ValueCountFrequency (%)
.12540
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common46719
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
013850
29.6%
.12540
26.8%
57954
17.0%
16813
14.6%
23427
 
7.3%
72135
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII46719
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
013850
29.6%
.12540
26.8%
57954
17.0%
16813
14.6%
23427
 
7.3%
72135
 
4.6%

vir
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct5408
Distinct (%)43.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean327.3002671
Minimum0
Maximum54299.22
Zeros3446
Zeros (%)27.5%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median57.155
Q3352.2525
95-th percentile1170.674
Maximum54299.22
Range54299.22
Interquartile range (IQR)352.2525

Descriptive statistics

Standard deviation1325.667188
Coefficient of variation (CV)4.050308911
Kurtosis645.3655006
Mean327.3002671
Median Absolute Deviation (MAD)57.155
Skewness22.0592276
Sum4104345.35
Variance1757393.494
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
03446
27.5%
0.0681
 
0.6%
0.0580
 
0.6%
253.1270
 
0.6%
0.0370
 
0.6%
0.0450
 
0.4%
0.0839
 
0.3%
0.0236
 
0.3%
0.0131
 
0.2%
255.6730
 
0.2%
Other values (5398)8607
68.6%
ValueCountFrequency (%)
03446
27.5%
0.0131
 
0.2%
0.0236
 
0.3%
0.0370
 
0.6%
0.0450
 
0.4%
0.0580
 
0.6%
0.0681
 
0.6%
0.0721
 
0.2%
0.0839
 
0.3%
0.0912
 
0.1%
ValueCountFrequency (%)
54299.221
< 0.1%
45791.131
< 0.1%
42861.491
< 0.1%
40436.11
< 0.1%
34158.981
< 0.1%
31964.51
< 0.1%
31423.981
< 0.1%
30028.511
< 0.1%
26765.551
< 0.1%
24630.81
< 0.1%

dc
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1797
Distinct (%)14.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean978.7117225
Minimum30
Maximum8765
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum30
5-th percentile90
Q1365
median730
Q31381.25
95-th percentile2589.05
Maximum8765
Range8735
Interquartile range (IQR)1016.25

Descriptive statistics

Standard deviation922.3521654
Coefficient of variation (CV)0.9424145479
Kurtosis10.21786063
Mean978.7117225
Median Absolute Deviation (MAD)370
Skewness2.439257558
Sum12273045
Variance850733.5171
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
365943
 
7.5%
1096450
 
3.6%
181448
 
3.6%
730414
 
3.3%
721330
 
2.6%
731322
 
2.6%
90310
 
2.5%
1095265
 
2.1%
1080239
 
1.9%
1461238
 
1.9%
Other values (1787)8581
68.4%
ValueCountFrequency (%)
3069
0.6%
3130
 
0.2%
3231
 
0.2%
331
 
< 0.1%
353
 
< 0.1%
401
 
< 0.1%
481
 
< 0.1%
60101
0.8%
6134
 
0.3%
6218
 
0.1%
ValueCountFrequency (%)
87651
< 0.1%
86131
< 0.1%
85222
< 0.1%
79491
< 0.1%
79481
< 0.1%
79441
< 0.1%
79431
< 0.1%
79421
< 0.1%
79401
< 0.1%
79371
< 0.1%

du
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1549
Distinct (%)12.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean672.273764
Minimum20
Maximum6028
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum20
5-th percentile61
Q1251
median501
Q3949
95-th percentile1779.05
Maximum6028
Range6008
Interquartile range (IQR)698

Descriptive statistics

Standard deviation634.4138864
Coefficient of variation (CV)0.943683839
Kurtosis10.21313161
Mean672.273764
Median Absolute Deviation (MAD)255
Skewness2.438516713
Sum8430313
Variance402480.9793
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2511095
 
8.7%
501521
 
4.2%
124397
 
3.2%
502344
 
2.7%
753288
 
2.3%
752288
 
2.3%
61286
 
2.3%
754282
 
2.2%
742272
 
2.2%
1005238
 
1.9%
Other values (1539)8529
68.0%
ValueCountFrequency (%)
2035
0.3%
2175
0.6%
2216
 
0.1%
235
 
< 0.1%
243
 
< 0.1%
271
 
< 0.1%
321
 
< 0.1%
4068
0.5%
4130
 
0.2%
4250
0.4%
ValueCountFrequency (%)
60281
< 0.1%
59211
< 0.1%
58592
< 0.1%
54671
< 0.1%
54661
< 0.1%
54641
< 0.1%
54631
< 0.1%
54621
< 0.1%
54601
< 0.1%
54591
< 0.1%

rbd
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1634
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.05284976874
Minimum0
Maximum0.120032
Zeros15
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile0.0422772
Q10.048932
median0.053269
Q30.05739
95-th percentile0.067423
Maximum0.120032
Range0.120032
Interquartile range (IQR)0.008458

Descriptive statistics

Standard deviation0.00931271235
Coefficient of variation (CV)0.1762110331
Kurtosis13.65684801
Mean0.05284976874
Median Absolute Deviation (MAD)0.004121
Skewness-2.362668719
Sum662.7361
Variance8.672661132 × 10-5
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.067423586
 
4.7%
0.055476563
 
4.5%
0.058406277
 
2.2%
0.050788262
 
2.1%
0.055359214
 
1.7%
0.053835205
 
1.6%
0.05282191
 
1.5%
0.056883184
 
1.5%
0.051804160
 
1.3%
0.054851153
 
1.2%
Other values (1624)9745
77.7%
ValueCountFrequency (%)
015
0.1%
1 × 10-51
 
< 0.1%
3.6 × 10-53
 
< 0.1%
6.1 × 10-52
 
< 0.1%
0.0001371
 
< 0.1%
0.0001633
 
< 0.1%
0.0002696
 
< 0.1%
0.0002741
 
< 0.1%
0.0002791
 
< 0.1%
0.0002899
0.1%
ValueCountFrequency (%)
0.1200321
 
< 0.1%
0.1196161
 
< 0.1%
0.1124961
 
< 0.1%
0.1081791
 
< 0.1%
0.0959891
 
< 0.1%
0.0830891
 
< 0.1%
0.0744211
 
< 0.1%
0.0742751
 
< 0.1%
0.0728514
0.1%
0.0727791
 
< 0.1%

rbm
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1534
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.115916722
Minimum0
Maximum2.5512
Zeros15
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile0.891565
Q11.0326
median1.12465
Q31.2121
95-th percentile1.4255
Maximum2.5512
Range2.5512
Interquartile range (IQR)0.1795

Descriptive statistics

Standard deviation0.1972144266
Coefficient of variation (CV)0.1767286237
Kurtosis13.51557058
Mean1.115916722
Median Absolute Deviation (MAD)0.08745
Skewness-2.334444068
Sum13993.5957
Variance0.03889353005
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1.4255586
 
4.7%
1.1715566
 
4.5%
1.2337277
 
2.2%
1.072262
 
2.1%
1.169214
 
1.7%
1.1366205
 
1.6%
1.1151191
 
1.5%
1.2014184
 
1.5%
1.0935169
 
1.3%
1.1582153
 
1.2%
Other values (1524)9733
77.6%
ValueCountFrequency (%)
015
0.1%
0.00021
 
< 0.1%
0.00073
 
< 0.1%
0.00132
 
< 0.1%
0.00291
 
< 0.1%
0.00343
 
< 0.1%
0.00576
 
< 0.1%
0.00581
 
< 0.1%
0.00591
 
< 0.1%
0.00619
0.1%
ValueCountFrequency (%)
2.55121
 
< 0.1%
2.54221
 
< 0.1%
2.38921
 
< 0.1%
2.29651
 
< 0.1%
2.03521
 
< 0.1%
1.75941
 
< 0.1%
1.57451
 
< 0.1%
1.57141
 
< 0.1%
1.541114
0.1%
1.53951
 
< 0.1%

rba
Real number (ℝ≥0)

HIGH CORRELATION

Distinct921
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.27248405
Minimum0
Maximum35.3
Zeros16
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile11.2395
Q113.12
median14.36
Q315.56
95-th percentile18.51
Maximum35.3
Range35.3
Interquartile range (IQR)2.44

Descriptive statistics

Standard deviation2.611150202
Coefficient of variation (CV)0.1829499471
Kurtosis12.05153723
Mean14.27248405
Median Absolute Deviation (MAD)1.2
Skewness-2.012059401
Sum178976.95
Variance6.818105377
MonotonicityDecreasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18.51587
 
4.7%
15570
 
4.5%
15.85317
 
2.5%
13.65263
 
2.1%
14.97216
 
1.7%
14.53206
 
1.6%
15.41196
 
1.6%
14.23195
 
1.6%
14.82171
 
1.4%
13.94170
 
1.4%
Other values (911)9649
76.9%
ValueCountFrequency (%)
016
0.1%
0.013
 
< 0.1%
0.022
 
< 0.1%
0.031
 
< 0.1%
0.043
 
< 0.1%
0.0718
0.1%
0.083
 
< 0.1%
0.092
 
< 0.1%
0.118
0.1%
0.1120
0.2%
ValueCountFrequency (%)
35.31
 
< 0.1%
35.161
 
< 0.1%
32.751
 
< 0.1%
31.321
 
< 0.1%
27.351
 
< 0.1%
23.281
 
< 0.1%
20.621
 
< 0.1%
20.581
 
< 0.1%
20.1414
0.1%
20.121
 
< 0.1%

rbp
Real number (ℝ≥0)

HIGH CORRELATION

Distinct5460
Distinct (%)43.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59.12558134
Minimum0
Maximum3184.75
Zeros19
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile2.86
Q113.16
median30.2
Q359.675
95-th percentile178.9045
Maximum3184.75
Range3184.75
Interquartile range (IQR)46.515

Descriptive statistics

Standard deviation141.9712207
Coefficient of variation (CV)2.40118097
Kurtosis201.3832365
Mean59.12558134
Median Absolute Deviation (MAD)20.855
Skewness12.14136769
Sum741434.79
Variance20155.82752
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
14.4682
 
0.7%
14.3243
 
0.3%
14.1739
 
0.3%
14.6137
 
0.3%
3.1837
 
0.3%
6.536
 
0.3%
3.2135
 
0.3%
32.2933
 
0.3%
12.4431
 
0.2%
6.6428
 
0.2%
Other values (5450)12139
96.8%
ValueCountFrequency (%)
019
0.2%
0.014
 
< 0.1%
0.022
 
< 0.1%
0.031
 
< 0.1%
0.041
 
< 0.1%
0.061
 
< 0.1%
0.071
 
< 0.1%
0.081
 
< 0.1%
0.091
 
< 0.1%
0.114
 
< 0.1%
ValueCountFrequency (%)
3184.751
< 0.1%
3025.682
< 0.1%
2950.361
< 0.1%
2820.331
< 0.1%
2691.381
< 0.1%
2666.351
< 0.1%
2636.541
< 0.1%
2633.221
< 0.1%
2631.561
< 0.1%
2629.911
< 0.1%

rlp
Real number (ℝ≥0)

HIGH CORRELATION

Distinct5219
Distinct (%)41.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.89699841
Minimum0
Maximum3184.75
Zeros19
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile2.43
Q111.31
median26.45
Q351.7725
95-th percentile172.5055
Maximum3184.75
Range3184.75
Interquartile range (IQR)40.4625

Descriptive statistics

Standard deviation141.7093711
Coefficient of variation (CV)2.581368294
Kurtosis204.3474148
Mean54.89699841
Median Absolute Deviation (MAD)18.31
Skewness12.28668235
Sum688408.36
Variance20081.54586
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11.9383
 
0.7%
11.8145
 
0.4%
11.6944
 
0.4%
2.4939
 
0.3%
12.0538
 
0.3%
5.236
 
0.3%
2.4635
 
0.3%
27.4533
 
0.3%
5.3132
 
0.3%
12.4431
 
0.2%
Other values (5209)12124
96.7%
ValueCountFrequency (%)
019
0.2%
0.014
 
< 0.1%
0.022
 
< 0.1%
0.032
 
< 0.1%
0.051
 
< 0.1%
0.062
 
< 0.1%
0.071
 
< 0.1%
0.094
 
< 0.1%
0.123
 
< 0.1%
0.141
 
< 0.1%
ValueCountFrequency (%)
3184.751
< 0.1%
3025.682
< 0.1%
2950.361
< 0.1%
2820.331
< 0.1%
2691.381
< 0.1%
2666.351
< 0.1%
2636.541
< 0.1%
2633.221
< 0.1%
2631.561
< 0.1%
2629.911
< 0.1%

rld
Real number (ℝ≥0)

HIGH CORRELATION

Distinct5072
Distinct (%)40.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.04680759027
Minimum0
Maximum0.120032
Zeros15
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile0.038834
Q10.042248
median0.046208
Q30.0524625
95-th percentile0.05877
Maximum0.120032
Range0.120032
Interquartile range (IQR)0.0102145

Descriptive statistics

Standard deviation0.008629389881
Coefficient of variation (CV)0.1843587724
Kurtosis11.50224594
Mean0.04680759027
Median Absolute Deviation (MAD)0.004711
Skewness-1.89163215
Sum586.967182
Variance7.446636972 × 10-5
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.047233150
 
1.2%
0.046725137
 
1.1%
0.04824994
 
0.7%
0.04570992
 
0.7%
0.04492282
 
0.7%
0.04926481
 
0.6%
0.04774170
 
0.6%
0.04621752
 
0.4%
0.04418651
 
0.4%
0.04520144
 
0.4%
Other values (5062)11687
93.2%
ValueCountFrequency (%)
015
0.1%
8 × 10-61
 
< 0.1%
2.8 × 10-53
 
< 0.1%
4.9 × 10-52
 
< 0.1%
0.0001131
 
< 0.1%
0.0001262
 
< 0.1%
0.000131
 
< 0.1%
0.0002296
 
< 0.1%
0.0002331
 
< 0.1%
0.0002371
 
< 0.1%
ValueCountFrequency (%)
0.1200321
< 0.1%
0.1196161
< 0.1%
0.1081791
< 0.1%
0.0963881
< 0.1%
0.0842221
< 0.1%
0.0702891
< 0.1%
0.0667431
< 0.1%
0.0666941
< 0.1%
0.0663611
< 0.1%
0.0653121
< 0.1%

rlm
Real number (ℝ≥0)

HIGH CORRELATION

Distinct87
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.9878229665
Minimum0
Maximum2.55
Zeros33
Zeros (%)0.3%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile0.82
Q10.89
median0.97
Q31.11
95-th percentile1.24
Maximum2.55
Range2.55
Interquartile range (IQR)0.22

Descriptive statistics

Standard deviation0.1827147208
Coefficient of variation (CV)0.184967071
Kurtosis11.36383496
Mean0.9878229665
Median Absolute Deviation (MAD)0.1
Skewness-1.862269083
Sum12387.3
Variance0.03338466919
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.95475
 
3.8%
0.91464
 
3.7%
0.94434
 
3.5%
0.93408
 
3.3%
1.2408
 
3.3%
0.98397
 
3.2%
0.85395
 
3.1%
0.99388
 
3.1%
0.86381
 
3.0%
0.84371
 
3.0%
Other values (77)8419
67.1%
ValueCountFrequency (%)
033
 
0.3%
0.01138
1.1%
0.0210
 
0.1%
0.039
 
0.1%
0.421
 
< 0.1%
0.641
 
< 0.1%
0.663
 
< 0.1%
0.672
 
< 0.1%
0.683
 
< 0.1%
0.692
 
< 0.1%
ValueCountFrequency (%)
2.551
< 0.1%
2.541
< 0.1%
2.31
< 0.1%
2.041
< 0.1%
1.781
< 0.1%
1.491
< 0.1%
1.412
< 0.1%
1.41
< 0.1%
1.381
< 0.1%
1.372
< 0.1%

rla
Real number (ℝ≥0)

HIGH CORRELATION

Distinct808
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.54233174
Minimum0
Maximum35.3
Zeros16
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile10.28
Q111.23
median12.35
Q314.13
95-th percentile15.96
Maximum35.3
Range35.3
Interquartile range (IQR)2.9

Descriptive statistics

Standard deviation2.399627972
Coefficient of variation (CV)0.1913223173
Kurtosis10.45712886
Mean12.54233174
Median Absolute Deviation (MAD)1.33
Skewness-1.569215557
Sum157280.84
Variance5.758214405
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12.64159
 
1.3%
11.98147
 
1.2%
12.49143
 
1.1%
12.21134
 
1.1%
13.2198
 
0.8%
12.9396
 
0.8%
10.4590
 
0.7%
11.886
 
0.7%
12.5885
 
0.7%
10.8584
 
0.7%
Other values (798)11418
91.1%
ValueCountFrequency (%)
016
0.1%
0.015
 
< 0.1%
0.034
 
< 0.1%
0.0621
0.2%
0.072
 
< 0.1%
0.083
 
< 0.1%
0.0932
0.3%
0.12
 
< 0.1%
0.116
 
< 0.1%
0.1231
0.2%
ValueCountFrequency (%)
35.31
< 0.1%
35.161
< 0.1%
31.321
< 0.1%
27.481
< 0.1%
23.631
< 0.1%
19.371
< 0.1%
18.311
< 0.1%
18.31
< 0.1%
18.21
< 0.1%
17.881
< 0.1%

vb
Real number (ℝ≥0)

HIGH CORRELATION

Distinct8228
Distinct (%)65.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14596.6772
Minimum1.01
Maximum1503664.86
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum1.01
5-th percentile256.5425
Q11533.63
median10321.38
Q313168.95
95-th percentile20822.3275
Maximum1503664.86
Range1503663.85
Interquartile range (IQR)11635.32

Descriptive statistics

Standard deviation70284.11083
Coefficient of variation (CV)4.815076053
Kurtosis228.3160868
Mean14596.6772
Median Absolute Deviation (MAD)6583.97
Skewness14.38400206
Sum183042332.1
Variance4939856235
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11446.469
 
0.6%
11431.8330
 
0.2%
11460.9930
 
0.2%
11417.2727
 
0.2%
11244.0325
 
0.2%
11301.4824
 
0.2%
13228.8524
 
0.2%
10317.7723
 
0.2%
10320.9623
 
0.2%
11228.2823
 
0.2%
Other values (8218)12242
97.6%
ValueCountFrequency (%)
1.011
 
< 0.1%
1.024
< 0.1%
1.033
< 0.1%
1.072
 
< 0.1%
1.111
 
< 0.1%
1.122
 
< 0.1%
1.133
< 0.1%
1.144
< 0.1%
1.156
< 0.1%
1.162
 
< 0.1%
ValueCountFrequency (%)
1503664.861
< 0.1%
1481479.551
< 0.1%
1452062.561
< 0.1%
1430349.091
< 0.1%
1424971.581
< 0.1%
1361994.821
< 0.1%
1360042.771
< 0.1%
1305274.191
< 0.1%
1285743.261
< 0.1%
1278661.561
< 0.1%

vl
Real number (ℝ≥0)

HIGH CORRELATION

Distinct8313
Distinct (%)66.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14269.37693
Minimum1.01
Maximum1503664.86
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum1.01
5-th percentile240.1405
Q11464.1625
median10254.455
Q312763.4075
95-th percentile20061.097
Maximum1503664.86
Range1503663.85
Interquartile range (IQR)11299.245

Descriptive statistics

Standard deviation69687.65536
Coefficient of variation (CV)4.883720971
Kurtosis229.9017004
Mean14269.37693
Median Absolute Deviation (MAD)6116.75
Skewness14.44052035
Sum178937986.7
Variance4856369309
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11193.2869
 
0.6%
1.2533
 
0.3%
11181.2630
 
0.2%
11205.3230
 
0.2%
11169.2527
 
0.2%
11244.0325
 
0.2%
12744.5224
 
0.2%
10248.7523
 
0.2%
10167.0523
 
0.2%
10246.2723
 
0.2%
Other values (8303)12233
97.6%
ValueCountFrequency (%)
1.011
 
< 0.1%
1.026
< 0.1%
1.031
 
< 0.1%
1.051
 
< 0.1%
1.061
 
< 0.1%
1.114
< 0.1%
1.129
0.1%
1.132
 
< 0.1%
1.144
< 0.1%
1.152
 
< 0.1%
ValueCountFrequency (%)
1503664.861
< 0.1%
1481479.551
< 0.1%
1452062.561
< 0.1%
1430349.091
< 0.1%
1424971.581
< 0.1%
1360042.771
< 0.1%
1307695.61
< 0.1%
1278661.561
< 0.1%
1259483.061
< 0.1%
1242881.771
< 0.1%

vrl
Real number (ℝ≥0)

HIGH CORRELATION

Distinct8173
Distinct (%)65.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3258.176935
Minimum0
Maximum503664.86
Zeros16
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile8.49
Q1251.7475
median1150.155
Q33024.06
95-th percentile8057.9715
Maximum503664.86
Range503664.86
Interquartile range (IQR)2772.3125

Descriptive statistics

Standard deviation16073.61803
Coefficient of variation (CV)4.933316498
Kurtosis428.2790717
Mean3258.176935
Median Absolute Deviation (MAD)1034.605
Skewness18.83988175
Sum40857538.77
Variance258361196.5
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1193.2869
 
0.6%
1205.3230
 
0.2%
1181.2630
 
0.2%
1169.2527
 
0.2%
1244.0325
 
0.2%
2744.5224
 
0.2%
167.0523
 
0.2%
248.7523
 
0.2%
246.2723
 
0.2%
1228.2823
 
0.2%
Other values (8163)12243
97.6%
ValueCountFrequency (%)
016
0.1%
0.012
 
< 0.1%
0.0211
0.1%
0.036
 
< 0.1%
0.0411
0.1%
0.059
0.1%
0.069
0.1%
0.075
 
< 0.1%
0.085
 
< 0.1%
0.0914
0.1%
ValueCountFrequency (%)
503664.861
< 0.1%
481479.551
< 0.1%
452062.561
< 0.1%
430349.091
< 0.1%
424971.581
< 0.1%
376141.571
< 0.1%
360472.991
< 0.1%
360042.771
< 0.1%
336997.861
< 0.1%
320743.361
< 0.1%

prlt
Real number (ℝ≥0)

HIGH CORRELATION

Distinct5304
Distinct (%)42.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54.89605981
Minimum0
Maximum3184.75
Zeros19
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile2.43
Q111.31
median26.45
Q351.7925
95-th percentile172.5055
Maximum3184.75
Range3184.75
Interquartile range (IQR)40.4825

Descriptive statistics

Standard deviation141.7095735
Coefficient of variation (CV)2.581416115
Kurtosis204.3465514
Mean54.89605981
Median Absolute Deviation (MAD)18.32
Skewness12.28664802
Sum688396.59
Variance20081.60321
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11.9383
 
0.7%
11.8145
 
0.4%
11.6944
 
0.4%
5.240
 
0.3%
2.4639
 
0.3%
2.4938
 
0.3%
12.0538
 
0.3%
5.3132
 
0.3%
12.4431
 
0.2%
1.5930
 
0.2%
Other values (5294)12120
96.7%
ValueCountFrequency (%)
019
0.2%
0.014
 
< 0.1%
0.022
 
< 0.1%
0.032
 
< 0.1%
0.051
 
< 0.1%
0.062
 
< 0.1%
0.071
 
< 0.1%
0.094
 
< 0.1%
0.123
 
< 0.1%
0.141
 
< 0.1%
ValueCountFrequency (%)
3184.751
< 0.1%
3025.682
< 0.1%
2950.361
< 0.1%
2820.331
< 0.1%
2691.381
< 0.1%
2666.351
< 0.1%
2636.541
< 0.1%
2633.221
< 0.1%
2631.561
< 0.1%
2629.911
< 0.1%

idx
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
CDI
4913 
PRÉ
4326 
IPCA
3301 

Length

Max length4
Median length3
Mean length3.26323764
Min length3

Characters and Unicode

Total characters40921
Distinct characters7
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCDI
2nd rowCDI
3rd rowCDI
4th rowCDI
5th rowCDI

Common Values

ValueCountFrequency (%)
CDI4913
39.2%
PRÉ4326
34.5%
IPCA3301
26.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
cdi4913
39.2%
pré4326
34.5%
ipca3301
26.3%

Most occurring characters

ValueCountFrequency (%)
C8214
20.1%
I8214
20.1%
P7627
18.6%
D4913
12.0%
R4326
10.6%
É4326
10.6%
A3301
8.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter40921
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C8214
20.1%
I8214
20.1%
P7627
18.6%
D4913
12.0%
R4326
10.6%
É4326
10.6%
A3301
8.1%

Most occurring scripts

ValueCountFrequency (%)
Latin40921
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
C8214
20.1%
I8214
20.1%
P7627
18.6%
D4913
12.0%
R4326
10.6%
É4326
10.6%
A3301
8.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII36595
89.4%
None4326
 
10.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C8214
22.4%
I8214
22.4%
P7627
20.8%
D4913
13.4%
R4326
11.8%
A3301
9.0%
None
ValueCountFrequency (%)
É4326
100.0%

rpd
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
0.0181
7773 
0.0167
4767 

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters75240
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0181
2nd row0.0181
3rd row0.0181
4th row0.0181
5th row0.0181

Common Values

ValueCountFrequency (%)
0.01817773
62.0%
0.01674767
38.0%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
0.01817773
62.0%
0.01674767
38.0%

Most occurring characters

ValueCountFrequency (%)
025080
33.3%
120313
27.0%
.12540
16.7%
87773
 
10.3%
64767
 
6.3%
74767
 
6.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number62700
83.3%
Other Punctuation12540
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
025080
40.0%
120313
32.4%
87773
 
12.4%
64767
 
7.6%
74767
 
7.6%
Other Punctuation
ValueCountFrequency (%)
.12540
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common75240
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
025080
33.3%
120313
27.0%
.12540
16.7%
87773
 
10.3%
64767
 
6.3%
74767
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII75240
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
025080
33.3%
120313
27.0%
.12540
16.7%
87773
 
10.3%
64767
 
6.3%
74767
 
6.3%

rpp
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1696
Distinct (%)13.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.2882512
Minimum0.33
Maximum197.34
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0.33
5-th percentile1.11
Q14.28
median9.36
Q318.005
95-th percentile37.36
Maximum197.34
Range197.01
Interquartile range (IQR)13.725

Descriptive statistics

Standard deviation14.80452544
Coefficient of variation (CV)1.11410638
Kurtosis26.77338201
Mean13.2882512
Median Absolute Deviation (MAD)5.22
Skewness3.871722036
Sum166634.67
Variance219.1739736
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.28597
 
4.8%
4.64507
 
4.0%
2.27327
 
2.6%
9.48265
 
2.1%
8.73259
 
2.1%
9.36214
 
1.7%
8.75211
 
1.7%
1.11202
 
1.6%
13.42198
 
1.6%
14.35194
 
1.5%
Other values (1686)9566
76.3%
ValueCountFrequency (%)
0.335
 
< 0.1%
0.3554
0.4%
0.3630
0.2%
0.375
 
< 0.1%
0.3826
0.2%
0.411
 
0.1%
0.433
 
< 0.1%
0.491
 
< 0.1%
0.581
 
< 0.1%
0.6718
 
0.1%
ValueCountFrequency (%)
197.341
 
< 0.1%
191.641
 
< 0.1%
188.392
< 0.1%
167.691
 
< 0.1%
167.61
 
< 0.1%
167.551
 
< 0.1%
167.51
 
< 0.1%
167.451
 
< 0.1%
167.43
< 0.1%
149.211
 
< 0.1%

vpp
Real number (ℝ≥0)

HIGH CORRELATION

Distinct3401
Distinct (%)27.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12110.34547
Minimum1
Maximum1146651.57
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum1
5-th percentile202.263
Q11199.01
median10104.09
Q310936.09
95-th percentile12551.22
Maximum1146651.57
Range1146650.57
Interquartile range (IQR)9737.08

Descriptive statistics

Standard deviation61505.85523
Coefficient of variation (CV)5.078786182
Kurtosis217.2144123
Mean12110.34547
Median Absolute Deviation (MAD)4123.11
Skewness14.17117693
Sum151863732.2
Variance3782970228
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10428.14491
 
3.9%
10464.19300
 
2.4%
10872.8251
 
2.0%
11340.2171
 
1.4%
10226.69158
 
1.3%
10874.61149
 
1.2%
1046.42149
 
1.2%
1094.8144
 
1.1%
11342.09140
 
1.1%
10949.94134
 
1.1%
Other values (3391)10453
83.4%
ValueCountFrequency (%)
11
 
< 0.1%
1.017
 
0.1%
1.022
 
< 0.1%
1.043
 
< 0.1%
1.054
 
< 0.1%
1.062
 
< 0.1%
1.0713
0.1%
1.086
 
< 0.1%
1.0924
0.2%
1.19
 
0.1%
ValueCountFrequency (%)
1146651.572
 
< 0.1%
1143546.515
< 0.1%
1118197.951
 
< 0.1%
1093608.962
 
< 0.1%
1093411.282
 
< 0.1%
1068980.781
 
< 0.1%
1057257.651
 
< 0.1%
1045663.081
 
< 0.1%
1045474.074
< 0.1%
1022669.121
 
< 0.1%

teq
Categorical

HIGH CARDINALITY

Distinct1703
Distinct (%)13.6%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
LCI/LCA IPCA + 7.42%
 
368
LCI/LCA 12.0% PRÉ
 
227
LCI/LCA IPCA + 7.2%
 
215
LCI/LCA 97.75% CDI
 
214
LCI/LCA 11.62% PRÉ
 
180
Other values (1698)
11336 

Length

Max length20
Median length19
Mean length17.24043062
Min length9

Characters and Unicode

Total characters216195
Distinct characters24
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique559 ?
Unique (%)4.5%

Sample

1st rowLCI/LCA 194.98% CDI
2nd rowLCI/LCA 194.3% CDI
3rd rowLCI/LCA 182.74% CDI
4th rowLCI/LCA 181.05% CDI
5th rowLCI/LCA 160.65% CDI

Common Values

ValueCountFrequency (%)
LCI/LCA IPCA + 7.42%368
 
2.9%
LCI/LCA 12.0% PRÉ227
 
1.8%
LCI/LCA IPCA + 7.2%215
 
1.7%
LCI/LCA 97.75% CDI214
 
1.7%
LCI/LCA 11.62% PRÉ180
 
1.4%
LCI/LCA 12.38% PRÉ164
 
1.3%
LCI/LCA 95.2% CDI157
 
1.3%
LCI/LCA 93.5% CDI127
 
1.0%
LCI/LCA 92.65% CDI124
 
1.0%
LCI/LCA 91.8% CDI122
 
1.0%
Other values (1693)10642
84.9%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
lci/lca9136
22.4%
cdi4913
12.0%
pré4322
 
10.6%
cdb3404
 
8.3%
3275
 
8.0%
ipca3237
 
7.9%
7.42369
 
0.9%
12.0237
 
0.6%
7.2221
 
0.5%
97.75214
 
0.5%
Other values (1401)11499
28.2%

Most occurring characters

ValueCountFrequency (%)
C29826
13.8%
28287
13.1%
L18272
 
8.5%
I17286
 
8.0%
.12540
 
5.8%
%12540
 
5.8%
A12373
 
5.7%
19748
 
4.5%
/9136
 
4.2%
D8317
 
3.8%
Other values (14)57870
26.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter105681
48.9%
Decimal Number44736
20.7%
Other Punctuation34216
 
15.8%
Space Separator28287
 
13.1%
Math Symbol3275
 
1.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
19748
21.8%
24637
10.4%
54388
9.8%
74350
9.7%
84342
9.7%
04135
9.2%
94065
9.1%
63692
 
8.3%
43123
 
7.0%
32256
 
5.0%
Uppercase Letter
ValueCountFrequency (%)
C29826
28.2%
L18272
17.3%
I17286
16.4%
A12373
11.7%
D8317
 
7.9%
P7559
 
7.2%
É4322
 
4.1%
R4322
 
4.1%
B3404
 
3.2%
Other Punctuation
ValueCountFrequency (%)
.12540
36.6%
%12540
36.6%
/9136
26.7%
Space Separator
ValueCountFrequency (%)
28287
100.0%
Math Symbol
ValueCountFrequency (%)
+3275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common110514
51.1%
Latin105681
48.9%

Most frequent character per script

Common
ValueCountFrequency (%)
28287
25.6%
.12540
11.3%
%12540
11.3%
19748
 
8.8%
/9136
 
8.3%
24637
 
4.2%
54388
 
4.0%
74350
 
3.9%
84342
 
3.9%
04135
 
3.7%
Other values (5)16411
14.8%
Latin
ValueCountFrequency (%)
C29826
28.2%
L18272
17.3%
I17286
16.4%
A12373
11.7%
D8317
 
7.9%
P7559
 
7.2%
É4322
 
4.1%
R4322
 
4.1%
B3404
 
3.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII211873
98.0%
None4322
 
2.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C29826
14.1%
28287
13.4%
L18272
 
8.6%
I17286
 
8.2%
.12540
 
5.9%
%12540
 
5.9%
A12373
 
5.8%
19748
 
4.6%
/9136
 
4.3%
D8317
 
3.9%
Other values (13)53548
25.3%
None
ValueCountFrequency (%)
É4322
100.0%

tt
Real number (ℝ≥0)

HIGH CORRELATION

Distinct921
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.27248405
Minimum0
Maximum35.3
Zeros16
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile11.2395
Q113.12
median14.36
Q315.56
95-th percentile18.51
Maximum35.3
Range35.3
Interquartile range (IQR)2.44

Descriptive statistics

Standard deviation2.611150202
Coefficient of variation (CV)0.1829499471
Kurtosis12.05153723
Mean14.27248405
Median Absolute Deviation (MAD)1.2
Skewness-2.012059401
Sum178976.95
Variance6.818105377
MonotonicityDecreasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
18.51587
 
4.7%
15570
 
4.5%
15.85317
 
2.5%
13.65263
 
2.1%
14.97216
 
1.7%
14.53206
 
1.6%
15.41196
 
1.6%
14.23195
 
1.6%
14.82171
 
1.4%
13.94170
 
1.4%
Other values (911)9649
76.9%
ValueCountFrequency (%)
016
0.1%
0.013
 
< 0.1%
0.022
 
< 0.1%
0.031
 
< 0.1%
0.043
 
< 0.1%
0.0718
0.1%
0.083
 
< 0.1%
0.092
 
< 0.1%
0.118
0.1%
0.1120
0.2%
ValueCountFrequency (%)
35.31
 
< 0.1%
35.161
 
< 0.1%
32.751
 
< 0.1%
31.321
 
< 0.1%
27.351
 
< 0.1%
23.281
 
< 0.1%
20.621
 
< 0.1%
20.581
 
< 0.1%
20.1414
0.1%
20.121
 
< 0.1%

am
Real number (ℝ≥0)

HIGH CORRELATION

Distinct104
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean183538.3812
Minimum7600
Maximum250000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum7600
5-th percentile90000
Q1160000
median190000
Q3220000
95-th percentile240000
Maximum250000
Range242400
Interquartile range (IQR)60000

Descriptive statistics

Standard deviation48441.75101
Coefficient of variation (CV)0.2639325393
Kurtosis0.4703224799
Mean183538.3812
Median Absolute Deviation (MAD)30000
Skewness-0.9262206881
Sum2301571300
Variance2346603241
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2200001734
13.8%
2400001266
10.1%
1900001195
9.5%
230000956
 
7.6%
180000863
 
6.9%
200000830
 
6.6%
170000789
 
6.3%
210000782
 
6.2%
160000751
 
6.0%
140000635
 
5.1%
Other values (94)2739
21.8%
ValueCountFrequency (%)
76001
 
< 0.1%
80002
 
< 0.1%
82001
 
< 0.1%
86001
 
< 0.1%
90002
 
< 0.1%
91002
 
< 0.1%
92004
< 0.1%
93003
< 0.1%
110004
< 0.1%
120005
< 0.1%
ValueCountFrequency (%)
250000341
 
2.7%
2400001266
10.1%
230000956
7.6%
2200001734
13.8%
210000782
6.2%
200000830
6.6%
1900001195
9.5%
180000863
6.9%
170000789
6.3%
160000751
6.0%

total
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1277
Distinct (%)10.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean44.84182057
Minimum0
Maximum236.34
Zeros17
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size98.1 KiB

Quantile statistics

Minimum0
5-th percentile5.25
Q19
median13.7
Q3100.25
95-th percentile113
Maximum236.34
Range236.34
Interquartile range (IQR)91.25

Descriptive statistics

Standard deviation45.87722992
Coefficient of variation (CV)1.023090261
Kurtosis-1.545007454
Mean44.84182057
Median Absolute Deviation (MAD)7.6
Skewness0.5770091296
Sum562316.43
Variance2104.720225
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
9586
 
4.7%
15563
 
4.5%
115277
 
2.2%
100234
 
1.9%
109214
 
1.7%
106205
 
1.6%
104191
 
1.5%
112184
 
1.5%
102160
 
1.3%
108153
 
1.2%
Other values (1267)9773
77.9%
ValueCountFrequency (%)
017
0.1%
0.021
 
< 0.1%
0.041
 
< 0.1%
0.073
 
< 0.1%
0.091
 
< 0.1%
0.12
 
< 0.1%
0.122
 
< 0.1%
0.141
 
< 0.1%
0.151
 
< 0.1%
0.183
 
< 0.1%
ValueCountFrequency (%)
236.341
< 0.1%
235.521
< 0.1%
221.51
< 0.1%
2131
< 0.1%
1891
< 0.1%
163.61
< 0.1%
1351
< 0.1%
128.51
< 0.1%
128.251
< 0.1%
1281
< 0.1%

cores
Categorical

HIGH CORRELATION

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
['linear-gradient(135deg, #00003C 0%,#242947 100%)']
6967 
['linear-gradient(135deg, #1e2021 0%,#3c3e40 100%)']
1367 
['linear-gradient(135deg, #030441 0%,#0e0e70 100%)']
837 
['linear-gradient(270deg, #41266B 0%,#351C55 100%)']
 
669
['linear-gradient(135deg, #005e77 1%,#007298 100%)']
 
533
Other values (19)
2167 

Length

Max length64
Median length52
Mean length52.11730463
Min length51

Characters and Unicode

Total characters653551
Distinct characters35
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row['linear-gradient(135deg, #474747 0%,#000000 100%)']
2nd row['linear-gradient(135deg, #474747 0%,#000000 100%)']
3rd row['linear-gradient(135deg, #474747 0%,#000000 100%)']
4th row['linear-gradient(135deg, #474747 0%,#000000 100%)']
5th row['linear-gradient(135deg, #474747 0%,#000000 100%)']

Common Values

ValueCountFrequency (%)
['linear-gradient(135deg, #00003C 0%,#242947 100%)']6967
55.6%
['linear-gradient(135deg, #1e2021 0%,#3c3e40 100%)']1367
 
10.9%
['linear-gradient(135deg, #030441 0%,#0e0e70 100%)']837
 
6.7%
['linear-gradient(270deg, #41266B 0%,#351C55 100%)']669
 
5.3%
['linear-gradient(135deg, #005e77 1%,#007298 100%)']533
 
4.3%
['linear-gradient(135deg, #313234 1%,#4c4e54 100%)']514
 
4.1%
['linear-gradient(135deg, #0072ce 0%,#178eee 100%)']375
 
3.0%
['linear-gradient(135deg, #008187 1%,#069b8a 100%)']314
 
2.5%
['linear-gradient(135deg, #275251 0%,#327775 100%)']275
 
2.2%
['linear-gradient(135deg, #001455 0%,#052488 100%)']185
 
1.5%
Other values (14)504
 
4.0%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
10012540
24.9%
linear-gradient(135deg11871
23.6%
00003c6967
13.9%
0%,#2429476967
13.9%
1e20211367
 
2.7%
0%,#3c3e401367
 
2.7%
030441837
 
1.7%
0%,#0e0e70837
 
1.7%
linear-gradient(270deg669
 
1.3%
41266b669
 
1.3%
Other values (41)6181
12.3%

Most occurring characters

ValueCountFrequency (%)
076541
 
11.7%
e44772
 
6.9%
37738
 
5.8%
133481
 
5.1%
a25819
 
4.0%
325318
 
3.9%
d25248
 
3.9%
%25203
 
3.9%
,25203
 
3.9%
#25203
 
3.9%
Other values (25)309025
47.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter224850
34.4%
Decimal Number219193
33.5%
Other Punctuation100689
15.4%
Space Separator37738
 
5.8%
Close Punctuation25080
 
3.8%
Open Punctuation25080
 
3.8%
Dash Punctuation12540
 
1.9%
Uppercase Letter8381
 
1.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e44772
19.9%
a25819
11.5%
d25248
11.2%
r25080
11.2%
g25080
11.2%
n25080
11.2%
i25080
11.2%
t12540
 
5.6%
l12540
 
5.6%
c2483
 
1.1%
Other values (2)1128
 
0.5%
Decimal Number
ValueCountFrequency (%)
076541
34.9%
133481
15.3%
325318
 
11.6%
221055
 
9.6%
420295
 
9.3%
516719
 
7.6%
712819
 
5.8%
98309
 
3.8%
82747
 
1.3%
61909
 
0.9%
Other Punctuation
ValueCountFrequency (%)
%25203
25.0%
,25203
25.0%
#25203
25.0%
'25080
24.9%
Uppercase Letter
ValueCountFrequency (%)
C7636
91.1%
B669
 
8.0%
E76
 
0.9%
Close Punctuation
ValueCountFrequency (%)
)12540
50.0%
]12540
50.0%
Open Punctuation
ValueCountFrequency (%)
[12540
50.0%
(12540
50.0%
Space Separator
ValueCountFrequency (%)
37738
100.0%
Dash Punctuation
ValueCountFrequency (%)
-12540
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common420320
64.3%
Latin233231
35.7%

Most frequent character per script

Common
ValueCountFrequency (%)
076541
18.2%
37738
 
9.0%
133481
 
8.0%
325318
 
6.0%
%25203
 
6.0%
,25203
 
6.0%
#25203
 
6.0%
'25080
 
6.0%
221055
 
5.0%
420295
 
4.8%
Other values (10)105203
25.0%
Latin
ValueCountFrequency (%)
e44772
19.2%
a25819
11.1%
d25248
10.8%
r25080
10.8%
g25080
10.8%
n25080
10.8%
i25080
10.8%
t12540
 
5.4%
l12540
 
5.4%
C7636
 
3.3%
Other values (5)4356
 
1.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII653551
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
076541
 
11.7%
e44772
 
6.9%
37738
 
5.8%
133481
 
5.1%
a25819
 
4.0%
325318
 
3.9%
d25248
 
3.9%
%25203
 
3.9%
,25203
 
3.9%
#25203
 
3.9%
Other values (25)309025
47.3%

logo
Categorical

HIGH CORRELATION

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size98.1 KiB
['safra.png']
6967 
['xp.png']
1367 
['rico.png']
837 
['nuinvest.png']
 
669
['novafutura.png']
 
533
Other values (19)
2167 

Length

Max length21
Median length13
Mean length13.12503987
Min length10

Characters and Unicode

Total characters164588
Distinct characters25
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row['hurst.png']
2nd row['hurst.png']
3rd row['hurst.png']
4th row['hurst.png']
5th row['hurst.png']

Common Values

ValueCountFrequency (%)
['safra.png']6967
55.6%
['xp.png']1367
 
10.9%
['rico.png']837
 
6.7%
['nuinvest.png']669
 
5.3%
['novafutura.png']533
 
4.3%
['agora.png']514
 
4.1%
['ativa.png']375
 
3.0%
['modalmais.png']314
 
2.5%
['terra.png']275
 
2.2%
['daycoval.png']185
 
1.5%
Other values (14)504
 
4.0%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
safra.png6967
55.6%
xp.png1367
 
10.9%
rico.png837
 
6.7%
nuinvest.png669
 
5.3%
novafutura.png533
 
4.3%
agora.png514
 
4.1%
ativa.png375
 
3.0%
modalmais.png314
 
2.5%
terra.png275
 
2.2%
daycoval.png185
 
1.5%
Other values (14)504
 
4.0%

Most occurring characters

ValueCountFrequency (%)
'25080
15.2%
a18442
11.2%
n14642
8.9%
p13985
8.5%
g13170
8.0%
[12540
7.6%
.12540
7.6%
]12540
7.6%
r9668
 
5.9%
s8055
 
4.9%
Other values (15)23926
14.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter101888
61.9%
Other Punctuation37620
 
22.9%
Open Punctuation12540
 
7.6%
Close Punctuation12540
 
7.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a18442
18.1%
n14642
14.4%
p13985
13.7%
g13170
12.9%
r9668
9.5%
s8055
7.9%
f7523
7.4%
o2558
 
2.5%
i2508
 
2.5%
t2104
 
2.1%
Other values (11)9233
9.1%
Other Punctuation
ValueCountFrequency (%)
'25080
66.7%
.12540
33.3%
Open Punctuation
ValueCountFrequency (%)
[12540
100.0%
Close Punctuation
ValueCountFrequency (%)
]12540
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin101888
61.9%
Common62700
38.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a18442
18.1%
n14642
14.4%
p13985
13.7%
g13170
12.9%
r9668
9.5%
s8055
7.9%
f7523
7.4%
o2558
 
2.5%
i2508
 
2.5%
t2104
 
2.1%
Other values (11)9233
9.1%
Common
ValueCountFrequency (%)
'25080
40.0%
[12540
20.0%
.12540
20.0%
]12540
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII164588
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
'25080
15.2%
a18442
11.2%
n14642
8.9%
p13985
8.5%
g13170
8.0%
[12540
7.6%
.12540
7.6%
]12540
7.6%
r9668
 
5.9%
s8055
 
4.9%
Other values (15)23926
14.5%

rico_tipo
Categorical

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.2%
Missing11703
Missing (%)93.3%
Memory size98.1 KiB
bancario
748 
privado
89 

Length

Max length8
Median length8
Mean length7.893667861
Min length7

Characters and Unicode

Total characters6607
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowbancario
2nd rowbancario
3rd rowbancario
4th rowbancario
5th rowbancario

Common Values

ValueCountFrequency (%)
bancario748
 
6.0%
privado89
 
0.7%
(Missing)11703
93.3%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
bancario748
89.4%
privado89
 
10.6%

Most occurring characters

ValueCountFrequency (%)
a1585
24.0%
r837
12.7%
i837
12.7%
o837
12.7%
b748
11.3%
n748
11.3%
c748
11.3%
p89
 
1.3%
v89
 
1.3%
d89
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter6607
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a1585
24.0%
r837
12.7%
i837
12.7%
o837
12.7%
b748
11.3%
n748
11.3%
c748
11.3%
p89
 
1.3%
v89
 
1.3%
d89
 
1.3%

Most occurring scripts

ValueCountFrequency (%)
Latin6607
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a1585
24.0%
r837
12.7%
i837
12.7%
o837
12.7%
b748
11.3%
n748
11.3%
c748
11.3%
p89
 
1.3%
v89
 
1.3%
d89
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII6607
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a1585
24.0%
r837
12.7%
i837
12.7%
o837
12.7%
b748
11.3%
n748
11.3%
c748
11.3%
p89
 
1.3%
v89
 
1.3%
d89
 
1.3%

xp_tipo
Categorical

HIGH CORRELATION
MISSING

Distinct2
Distinct (%)0.1%
Missing11173
Missing (%)89.1%
Memory size98.1 KiB
bancario
1228 
privado
139 

Length

Max length8
Median length8
Mean length7.898317484
Min length7

Characters and Unicode

Total characters10797
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowbancario
2nd rowbancario
3rd rowbancario
4th rowbancario
5th rowbancario

Common Values

ValueCountFrequency (%)
bancario1228
 
9.8%
privado139
 
1.1%
(Missing)11173
89.1%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
bancario1228
89.8%
privado139
 
10.2%

Most occurring characters

ValueCountFrequency (%)
a2595
24.0%
r1367
12.7%
i1367
12.7%
o1367
12.7%
b1228
11.4%
n1228
11.4%
c1228
11.4%
p139
 
1.3%
v139
 
1.3%
d139
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter10797
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a2595
24.0%
r1367
12.7%
i1367
12.7%
o1367
12.7%
b1228
11.4%
n1228
11.4%
c1228
11.4%
p139
 
1.3%
v139
 
1.3%
d139
 
1.3%

Most occurring scripts

ValueCountFrequency (%)
Latin10797
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a2595
24.0%
r1367
12.7%
i1367
12.7%
o1367
12.7%
b1228
11.4%
n1228
11.4%
c1228
11.4%
p139
 
1.3%
v139
 
1.3%
d139
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII10797
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a2595
24.0%
r1367
12.7%
i1367
12.7%
o1367
12.7%
b1228
11.4%
n1228
11.4%
c1228
11.4%
p139
 
1.3%
v139
 
1.3%
d139
 
1.3%

cod_cetip
Categorical

HIGH CARDINALITY
MISSING
UNIFORM

Distinct291
Distinct (%)94.2%
Missing12231
Missing (%)97.5%
Memory size98.1 KiB
ECHP11
 
2
ITPO14
 
2
EGIEA0
 
2
PTAZ11
 
2
CBAN12
 
2
Other values (286)
299 

Length

Max length11
Median length6
Mean length6.453074434
Min length6

Characters and Unicode

Total characters1994
Distinct characters35
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique273 ?
Unique (%)88.3%

Sample

1st rowSAUC14
2nd rowCRA0220058X
3rd rowJSMLB5
4th rowCNRD11
5th rowCRA022000RT

Common Values

ValueCountFrequency (%)
ECHP112
 
< 0.1%
ITPO142
 
< 0.1%
EGIEA02
 
< 0.1%
PTAZ112
 
< 0.1%
CBAN122
 
< 0.1%
CART132
 
< 0.1%
MSGT232
 
< 0.1%
MSGT122
 
< 0.1%
PLSB1A2
 
< 0.1%
MSGT332
 
< 0.1%
Other values (281)289
 
2.3%
(Missing)12231
97.5%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
echp112
 
0.6%
ctee172
 
0.6%
cuti112
 
0.6%
cjen132
 
0.6%
rumoa42
 
0.6%
anem112
 
0.6%
ibpb112
 
0.6%
neoe262
 
0.6%
gasc232
 
0.6%
itpo142
 
0.6%
Other values (281)289
93.5%

Most occurring characters

ValueCountFrequency (%)
1250
 
12.5%
E175
 
8.8%
2166
 
8.3%
A134
 
6.7%
T109
 
5.5%
C107
 
5.4%
089
 
4.5%
R88
 
4.4%
S83
 
4.2%
P75
 
3.8%
Other values (25)718
36.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter1276
64.0%
Decimal Number718
36.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
E175
13.7%
A134
10.5%
T109
 
8.5%
C107
 
8.4%
R88
 
6.9%
S83
 
6.5%
P75
 
5.9%
N70
 
5.5%
G69
 
5.4%
B55
 
4.3%
Other values (15)311
24.4%
Decimal Number
ValueCountFrequency (%)
1250
34.8%
2166
23.1%
089
 
12.4%
347
 
6.5%
433
 
4.6%
532
 
4.5%
628
 
3.9%
725
 
3.5%
824
 
3.3%
924
 
3.3%

Most occurring scripts

ValueCountFrequency (%)
Latin1276
64.0%
Common718
36.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
E175
13.7%
A134
10.5%
T109
 
8.5%
C107
 
8.4%
R88
 
6.9%
S83
 
6.5%
P75
 
5.9%
N70
 
5.5%
G69
 
5.4%
B55
 
4.3%
Other values (15)311
24.4%
Common
ValueCountFrequency (%)
1250
34.8%
2166
23.1%
089
 
12.4%
347
 
6.5%
433
 
4.6%
532
 
4.5%
628
 
3.9%
725
 
3.5%
824
 
3.3%
924
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII1994
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1250
 
12.5%
E175
 
8.8%
2166
 
8.3%
A134
 
6.7%
T109
 
5.5%
C107
 
5.4%
089
 
4.5%
R88
 
4.4%
S83
 
4.2%
P75
 
3.8%
Other values (25)718
36.0%

escalonado
Boolean

CONSTANT
MISSING
REJECTED

Distinct1
Distinct (%)5.9%
Missing12523
Missing (%)99.9%
Memory size98.1 KiB
False
 
17
(Missing)
12523 
ValueCountFrequency (%)
False17
 
0.1%
(Missing)12523
99.9%

avista_id
Categorical

HIGH CORRELATION
MISSING

Distinct4
Distinct (%)16.7%
Missing12516
Missing (%)99.8%
Memory size98.1 KiB
10.0
9.0
22.0
11.0

Length

Max length4
Median length4
Mean length3.666666667
Min length3

Characters and Unicode

Total characters88
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10.0
2nd row10.0
3rd row22.0
4th row22.0
5th row10.0

Common Values

ValueCountFrequency (%)
10.08
 
0.1%
9.08
 
0.1%
22.04
 
< 0.1%
11.04
 
< 0.1%
(Missing)12516
99.8%

Length

Histogram of lengths of the category

Category Frequency Plot

ValueCountFrequency (%)
10.08
33.3%
9.08
33.3%
22.04
16.7%
11.04
16.7%

Most occurring characters

ValueCountFrequency (%)
032
36.4%
.24
27.3%
116
18.2%
98
 
9.1%
28
 
9.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number64
72.7%
Other Punctuation24
 
27.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
032
50.0%
116
25.0%
98
 
12.5%
28
 
12.5%
Other Punctuation
ValueCountFrequency (%)
.24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common88
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
032
36.4%
.24
27.3%
116
18.2%
98
 
9.1%
28
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII88
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
032
36.4%
.24
27.3%
116
18.2%
98
 
9.1%
28
 
9.1%

Interactions

Correlations

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

_idtiposubtipovencimentoqtdMinimataxamsg_patrocinadopatrocinadoemissorliquidezincentivadaqualificadojurosamortizacaocarenciaratingagenciaprecocorretoranrainvestirtp_dchatenable_popupurltp_mercadotirvirdcdurbdrbmrbarbprlprldrlmrlavbvlvrlprltidxrpdrppvppteqttamtotalcoreslogorico_tipoxp_tipocod_cetipescalonadoavista_id
0{'$oid': '6325e0f59491f55d25c880cb'}ATIVO REALCARTEIRA DE PRECATÓRIOS MUNICIPAIS #38/2022547 dias10000.000000236.34% CDINaNFalseCARTEIRA DE PRECATÓRIOS MUNICIPAIS #38/2022No vencimentoFalseFalse236.34VencimentoNo vencimentoNaNNaN10000.000000Hurst Capital0Hurst CapitalFalseFintechFalseTruehttps://hurst.capital/operation/960dfe3e-e7a3-4d77-bc61-f5dcf40cdb12P0.00.005473750.1200322.551235.3056.8156.810.1200322.5535.3015680.8015680.805680.8056.81CDI0.01817.0110701.41LCI/LCA 194.98% CDI35.30160000.0236.34['linear-gradient(135deg, #474747 0%,#000000 100%)']['hurst.png']NaNNaNNaNNaNNaN
1{'$oid': '6325e0f59491f55d25c880cd'}ATIVO REALCARTEIRA DE PRECATÓRIOS FEDERAIS ALIMENTARES #39/2022577 dias10000.000000235.52% CDINaNFalseCARTEIRA DE PRECATÓRIOS FEDERAIS ALIMENTARES #39/2022No vencimentoFalseFalse235.52VencimentoNo vencimentoNaNNaN10000.000000Hurst Capital0Hurst CapitalFalseFintechFalseTruehttps://hurst.capital/operation/0e0349ff-81f5-4a71-ace1-ea50c5aa9731P0.00.005773960.1196162.542235.1660.5460.540.1196162.5435.1616054.3716054.376054.3760.54CDI0.01817.4210742.11LCI/LCA 194.3% CDI35.16160000.0235.52['linear-gradient(135deg, #474747 0%,#000000 100%)']['hurst.png']NaNNaNNaNNaNNaN
2{'$oid': '6325e0f59491f55d25c880c9'}ATIVO REALCARTEIRA FEDERAL #36/2022638 dias10000.000000221.5% CDINaNFalseCARTEIRA FEDERAL #36/2022No vencimentoFalseFalse221.50VencimentoNo vencimentoNaNNaN10000.000000Hurst Capital0Hurst CapitalFalseFintechFalseTruehttps://hurst.capital/operation/97294719-f536-4c72-8a6d-3fb12841f10fP17.51113.576384380.1124962.389232.7563.6352.500.0963882.0427.4816363.2615249.695249.6952.50CDI0.01818.2410823.98LCI/LCA 182.74% CDI32.75150000.0221.50['linear-gradient(135deg, #474747 0%,#000000 100%)']['hurst.png']NaNNaNNaNNaNNaN
3{'$oid': '6325e0f59491f55d25c880ca'}ATIVO REALOPERAÇÃO PRECATÓRIO ESTADUAL - PE #37/2022790 dias10000.000000213.0% CDINaNFalseOPERAÇÃO PRECATÓRIO ESTADUAL - PE #37/2022No vencimentoFalseFalse213.00VencimentoNo vencimentoNaNNaN10000.000000Hurst Capital0Hurst CapitalFalseFintechFalseTruehttps://hurst.capital/operation/3550baf5-a7a1-4b4e-91f0-158ddda60971P0.00.007905440.1081792.296531.3280.0780.070.1081792.3031.3218006.9618006.968006.9680.07CDI0.018110.3311033.39LCI/LCA 181.05% CDI31.32140000.0213.00['linear-gradient(135deg, #474747 0%,#000000 100%)']['hurst.png']NaNNaNNaNNaNNaN
4{'$oid': '6325e0f59491f55d25c880c8'}CCBCCB - INCORPORAÇÃO IMOBILIÁRIA RESIDENCIAL - PROJETO CASA IDEAL #04/2022730 dias1000.000000189.0% CDINaNFalseCCB - INCORPORAÇÃO IMOBILIÁRIA RESIDENCIAL - PROJETO CASA IDEAL #04/2022No vencimentoFalseFalse189.00VencimentoNo vencimentoNaNNaN1000.000000Hurst Capital0Hurst CapitalFalseFintechFalseTruehttps://hurst.capital/operation/60350e38-8cb4-4fa3-83d3-25cd18491474P15.093.047305030.0959892.035227.3562.0352.720.0842221.7823.631620.271527.23527.2352.72CDI0.01819.521095.19LCI/LCA 160.65% CDI27.35150000.0189.00['linear-gradient(135deg, #474747 0%,#000000 100%)']['hurst.png']NaNNaNNaNNaNNaN
5{'$oid': '6325e0f59491f55d25c880cc'}ATIVO REALACERVO JJ - MULHERES CONCRETAS #03/2022547 dias10000.000000163.6% CDINaNFalseACERVO JJ - MULHERES CONCRETAS #03/2022No vencimentoFalseFalse163.60VencimentoNo vencimentoNaNNaN10000.000000Hurst Capital0Hurst CapitalFalseFintechFalseTruehttps://hurst.capital/operation/73620874-ecac-431d-862a-8f542c225b09P17.5639.475473750.0830891.759423.2836.5430.150.0702891.4919.3713654.1113014.643014.6430.15CDI0.01817.0110701.41LCI/LCA 134.97% CDI23.28180000.0163.60['linear-gradient(135deg, #474747 0%,#000000 100%)']['hurst.png']NaNNaNNaNNaNNaN
6{'$oid': '62f1262520821896c213ad33'}CDBNaN365 dias10000.000000IPCA +8.84%NaNFalseBANCO BTG PACTUALNo vencimentoFalseNaN8.84NaNSem carênciaNaNNaN10000.000000Banco Safra25SafraFalseBancoFalseNaNNaNP17.5359.273652510.0744211.574520.6220.5316.940.0623571.3217.0112052.9811693.711693.7116.94IPCA0.01674.2810428.14LCI/LCA IPCA + 7.29%20.62210000.08.84['linear-gradient(135deg, #00003C 0%,#242947 100%)']['safra.png']NaNNaNNaNNaNNaN
7{'$oid': '62f1262520821896c213ad10'}CDBNaN365 dias10000.000000IPCA +8.8%NaNFalseBANCO PANNo vencimentoFalseNaN8.80NaNSem carênciaNaNNaN10000.000000Banco Safra25SafraFalseBancoFalseNaNNaNP17.5358.503652510.0742751.571420.5820.4916.900.0622321.3216.9712048.5711690.071690.0716.90IPCA0.01674.2810428.14LCI/LCA IPCA + 7.26%20.58210000.08.80['linear-gradient(135deg, #00003C 0%,#242947 100%)']['safra.png']NaNNaNNaNNaNNaN
8{'$oid': '632480fb02b43e2511270f81'}CDBNaN105 dias1083.532482IPCA +10.5%NaNFalseBANCO PANNo VencimentoFalseFalse10.50NaNSem carênciabrAAAS&P1083.532482Rico25RicoFalseCorretoraFalseNaNNaNS22.513.12105720.0728501.541120.145.384.170.0567841.2015.381141.861128.7445.204.17IPCA0.01811.311097.73LCI/LCA IPCA + 8.14%20.14240000.010.50['linear-gradient(135deg, #030441 0%,#0e0e70 100%)']['rico.png']bancarioNaNNaNNaNNaN
9{'$oid': '632480fb02b43e2511270f82'}CDBNaN127 dias1069.874330IPCA +10.5%NaNFalseBANCO PANNo VencimentoFalseFalse10.50NaNSem carênciabrAAAS&P1069.874330Rico25RicoFalseCorretoraFalseNaNNaNS22.515.93127880.0728501.541120.146.625.130.0568571.2015.401140.681124.7554.885.13IPCA0.01811.601087.03LCI/LCA IPCA + 8.14%20.14230000.010.50['linear-gradient(135deg, #030441 0%,#0e0e70 100%)']['rico.png']bancarioNaNNaNNaNNaN

Last rows

_idtiposubtipovencimentoqtdMinimataxamsg_patrocinadopatrocinadoemissorliquidezincentivadaqualificadojurosamortizacaocarenciaratingagenciaprecocorretoranrainvestirtp_dchatenable_popupurltp_mercadotirvirdcdurbdrbmrbarbprlprldrlmrlavbvlvrlprltidxrpdrppvppteqttamtotalcoreslogorico_tipoxp_tipocod_cetipescalonadoavista_id
12530{'$oid': '6324d2410580e99d15bf8dfb'}LCINaN720 dias250000.00.0% CDINaNFalseBANCO INTERNo VencimentoFalseFalse0.00Vencimento720 diasA-Fitch250000.0Banco Inter19Banco InterFalseBancoFalseNaNNaNP0.00.07204940.000000.00000.00.00.00.0000000.00.0250000.0250000.00.00.0CDI0.01819.34273352.82CDB 0.0% CDI0.0250000.00.00['linear-gradient(135deg, #ff5122 1%,#ff881b 100%)']['inter.png']NaNNaNNaNNaNNaN
12531{'$oid': '6324d2410580e99d15bf8dfc'}LCINaN720 dias500000.00.0% CDINaNFalseBANCO INTERNo VencimentoFalseFalse0.00Vencimento720 diasA-Fitch500000.0Banco Inter19Banco InterFalseBancoFalseNaNNaNP0.00.07204940.000000.00000.00.00.00.0000000.00.0500000.0500000.00.00.0CDI0.01819.34546705.64CDB 0.0% CDI0.0250000.00.00['linear-gradient(135deg, #ff5122 1%,#ff881b 100%)']['inter.png']NaNNaNNaNNaNNaN
12532{'$oid': '6324d2410580e99d15bf8dfd'}LCINaN720 dias750000.00.0% CDINaNFalseBANCO INTERNo VencimentoFalseFalse0.00Vencimento720 diasA-Fitch750000.0Banco Inter19Banco InterFalseBancoFalseNaNNaNP0.00.07204940.000000.00000.00.00.00.0000000.00.0750000.0750000.00.00.0CDI0.01819.34820058.46CDB 0.0% CDI0.0250000.00.00['linear-gradient(135deg, #ff5122 1%,#ff881b 100%)']['inter.png']NaNNaNNaNNaNNaN
12533{'$oid': '6324d2410580e99d15bf8dfe'}LCINaN720 dias1000000.00.0% CDINaNFalseBANCO INTERNo VencimentoFalseFalse0.00Vencimento720 diasA-Fitch1000000.0Banco Inter19Banco InterFalseBancoFalseNaNNaNP0.00.07204940.000000.00000.00.00.00.0000000.00.01000000.01000000.00.00.0CDI0.01819.341093411.28CDB 0.0% CDI0.0250000.00.00['linear-gradient(135deg, #ff5122 1%,#ff881b 100%)']['inter.png']NaNNaNNaNNaNNaN
12534{'$oid': '6324d2410580e99d15bf8e18'}LCINaN360 dias50.00.0% CDINaNFalseBANCO INTERNo VencimentoFalseFalse0.00Vencimento360 diasA-Fitch50.0Banco Inter19Banco InterFalseBancoFalseNaNNaNS0.00.03602460.000000.00000.00.00.00.0000000.00.050.050.00.00.0CDI0.01814.5552.27CDB 0.0% CDI0.0250000.00.00['linear-gradient(135deg, #ff5122 1%,#ff881b 100%)']['inter.png']NaNNaNNaNNaNNaN
12535{'$oid': '6324d2410580e99d15bf8e19'}LCINaN360 dias250000.00.0% CDINaNFalseBANCO INTERNo VencimentoFalseFalse0.00Vencimento360 diasA-Fitch250000.0Banco Inter19Banco InterFalseBancoFalseNaNNaNP0.00.03602460.000000.00000.00.00.00.0000000.00.0250000.0250000.00.00.0CDI0.01814.55261368.52CDB 0.0% CDI0.0250000.00.00['linear-gradient(135deg, #ff5122 1%,#ff881b 100%)']['inter.png']NaNNaNNaNNaNNaN
12536{'$oid': '6324d2410580e99d15bf8e1a'}LCINaN360 dias500000.00.0% CDINaNFalseBANCO INTERNo VencimentoFalseFalse0.00Vencimento360 diasA-Fitch500000.0Banco Inter19Banco InterFalseBancoFalseNaNNaNP0.00.03602460.000000.00000.00.00.00.0000000.00.0500000.0500000.00.00.0CDI0.01814.55522737.03CDB 0.0% CDI0.0250000.00.00['linear-gradient(135deg, #ff5122 1%,#ff881b 100%)']['inter.png']NaNNaNNaNNaNNaN
12537{'$oid': '6324d2410580e99d15bf8e1b'}LCINaN360 dias750000.00.0% CDINaNFalseBANCO INTERNo VencimentoFalseFalse0.00Vencimento360 diasA-Fitch750000.0Banco Inter19Banco InterFalseBancoFalseNaNNaNP0.00.03602460.000000.00000.00.00.00.0000000.00.0750000.0750000.00.00.0CDI0.01814.55784105.55CDB 0.0% CDI0.0250000.00.00['linear-gradient(135deg, #ff5122 1%,#ff881b 100%)']['inter.png']NaNNaNNaNNaNNaN
12538{'$oid': '6324d2410580e99d15bf8e1c'}LCINaN360 dias1000000.00.0% CDINaNFalseBANCO INTERNo VencimentoFalseFalse0.00Vencimento360 diasA-Fitch1000000.0Banco Inter19Banco InterFalseBancoFalseNaNNaNP0.00.03602460.000000.00000.00.00.00.0000000.00.01000000.01000000.00.00.0CDI0.01814.551045474.07CDB 0.0% CDI0.0250000.00.00['linear-gradient(135deg, #ff5122 1%,#ff881b 100%)']['inter.png']NaNNaNNaNNaNNaN
12539{'$oid': '6325375055c20d5626f1aee5'}CDBNaN91 dias500.00.02% CDINaNFalseBANCO MASTERNo vencimentoFalseFalse0.02Vencimento91 dias diasBBB-Fitch500.0Nova Futura16Nova FuturaFalseCorretoraFalseNaNNaNP22.50.091620.000010.00020.00.00.00.0000080.00.0500.0500.00.00.0CDI0.01811.13505.64LCI/LCA 0.02% CDI0.0250000.00.02['linear-gradient(135deg, #005e77 1%,#007298 100%)']['novafutura.png']NaNNaNNaNNaNNaN